CV¶
last revised: 2024-09
Yi Liu, Research Fellow, MRC Integrative Epidemiology Unit, University of Bristol
- GitHub https://github.com/yiliu6240
- Blog https://yiliu6240.github.io
- Emails
- yi.liu6240[at]gmail.com
- yi6240.liu[at]bristol.ac.uk
- UoB Research profile https://research-information.bris.ac.uk/en/persons/yi-liu
- ORCID https://orcid.org/0000-0002-2051-440X
- linkedin https://www.linkedin.com/in/yi-liu-08b7a1167/
- Complex network analysis
- Computational and statistical methods in causal inference
- Machine learning and predictive analytics
- Natural language processing
Professional experience¶
2023-08 -- present, Research fellow, MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol
2018-11 -- 2023-07, Senior research associate in health data science, MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol
I am affiliated with the Data Mining Epidemiological Relationships programme of Tom Gaunt.
2017-06 -- 2018-10, Postdoctoral Research Fellow, College of Social Sciences & International Studies, University of Exeter
I worked on the ESRC funded project “Inclusion and the academisation of English secondary schools: trends in the placement of pupils with significant SEN and those permanently excluded”.
Projects¶
Education¶
2012-09 -- 2017-07, PhD in Economics, Department of Economics, University of Birmingham
2011-09 -- 2012-08, MSc Money Banking and Finance, Department of Economics, University of Birmingham
2007-09 -- 2011-07, Financial management, School of Economics and Management, China University of Geosciences
Research output¶
base bibliography style: "Amarican Psychological Association 7th edition" from zotero
Journal articles¶
Liu, Y., & Gaunt, T. R. (2024). Triangulating evidence in health sciences with Annotated Semantic Queries. Bioinformatics, 40(9), btae519. https://doi.org/10.1093/bioinformatics/btae519
Zheng, J., Lu, J., Qi, J., Yang, Q., Zhao, H., Liu, H., Chen, Z., Huang, L., Ye, Y., Xu, M., Xu, Y., Wang, T., Li, M., Zhao, Z., Zheng, R., Wang, S., Lin, H., Hu, C., Ling Chui, C. S., … Bi, Y. (2024). The effect of SGLT2 inhibition on prostate cancer: Mendelian randomization and observational analysis using electronic healthcare and cohort data. Cell Reports Medicine, 5(8), 101688. https://doi.org/10.1016/j.xcrm.2024.101688
Lloyd, O., Liu, Y., & R. Gaunt, T. (2023). Assessing the effects of hyperparameters on knowledge graph embedding quality. Journal of Big Data, 10(1), 59. https://doi.org/10.1186/s40537-023-00732-5
Liu, Y., Elsworth, B. L., & Gaunt, T. R. (2023). Using language models and ontology topology to perform semantic mapping of traits between biomedical datasets. Bioinformatics. https://doi.org/10.1093/bioinformatics/btad169
Zhao, H., Rasheed, H., Nøst, T. H., Cho, Y., Liu, Y., Bhatta, L., Bhattacharya, A., Hemani, G., Davey Smith, G., Brumpton, B. M., Zhou, W., Neale, B. M., Gaunt, T. R., & Zheng, J. (2022). Proteome-wide Mendelian randomization in global biobank meta-analysis reveals multi-ancestry drug targets for common diseases. Cell Genomics, 2(11), 100195. https://doi.org/10.1016/j.xgen.2022.100195
Zheng, J., Zhang, Y., Zhao, H., Liu, Y., Baird, D., Karim, M. A., Ghoussaini, M., Schwartzentruber, J., Dunham, I., Elsworth, B., Roberts, K., Compton, H., Miller-Molloy, F., Liu, X., Wang, L., Zhang, H., Davey Smith, G., & Gaunt, T. R. (2022). Multi-ancestry Mendelian randomization of omics traits revealing drug targets of COVID-19 severity. eBioMedicine, 81, 104112. https://doi.org/10.1016/j.ebiom.2022.104112
Liu, Y., Elsworth, B., Erola, P., Haberland, V., Hemani, G., Lyon, M., Zheng, J., Lloyd, O., Vabistsevits, M., & Gaunt, T. R. (2020). EpiGraphDB: a database and data mining platform for health data science. Bioinformatics, 37(9), 1304–1311. https://doi.org/10.1093/bioinformatics/btaa961
Zheng, J., Haberland, V., Baird, D., Walker, V., Haycock, P. C., Hurle, M. R., Gutteridge, A., Erola, P., Liu, Y., Luo, S., Robinson, J., Richardson, T. G., Staley, J. R., Elsworth, B., Burgess, S., Sun, B. B., Danesh, J., Runz, H., Maranville, J. C., … Gaunt, T. R. (2020). Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases. Nature Genetics. https://doi.org/10.1038/s41588-020-0682-6
Liu, Y., Bessudnov, A., Black, A., & Norwich, B. (2020). School autonomy and educational inclusion of children with special needs: Evidence from England. British Educational Research Journal, 46(3), 532–552. https://doi.org/10.1002/berj.3593
Black, A., Bessudnov, A., Liu, Y., & Norwich, B. (2019). Academisation of Schools in England and Placements of Pupils With Special Educational Needs: An Analysis of Trends, 2011–2017. Frontiers in Education, 4. https://doi.org/10.3389/feduc.2019.00003
Working papers, preprints, conference proceedings¶
Vabistsevits, M., Robinson, T., Elsworth, B., Liu, Y., & Gaunt, T. R. (2022). Integrating Mendelian randomization and literature-mined evidence for breast cancer risk factors. medRxiv. https://doi.org/10.1101/2022.07.19.22277795
Zheng, J., Liu, Y., Elsworth, B., Bronson, P. G., Nounu, A., Haycock, P., Robinson, J. W., Babaei, M. S., Kanzaria, S., John, S., Prins, B., Runz, H., Hurle, M., Hemani, G., Butterworth, A., Scott, R. A., Davey Smith, G., & Gaunt, T. R. (2022). Multi-omics Mendelian randomization powering drug targets prioritization for common diseases [In preparation].
Elsworth, B., Lyon, M., Alexander, T., Liu, Y., Matthews, P., Hallett, J., Bates, P., Palmer, T., Haberland, V., Davey Smith, G., Zheng, J., Haycock, P., Gaunt, T. R., & Hemani, G. (2020). The MRC IEU OpenGWAS data infrastructure. biorxiv. https://doi.org/10.1101/2020.08.10.244293
Elsworth, B., Liu, Y., & Gaunt, T. R. (2019). Vectology – exploring biomedical variable relationships using sentence embedding and vectors, November 2019. 1st International “Alan Turing” Conference on Decision Support and Recommender Systems.
Conference presentations¶
Liu, Y. (2023). Triangulating evidence in health sciences with Annotated Semantic Queries. 2023 Research Software Engineering in Data Science & AI Workshop, Warwick, UK.
Elsworth, B., & Liu, Y. (2019). Vectology – exploring biomedical variable relationships using sentence embedding and vectors. 1st International “Alan Turing” Conference on Decision Support and Recommender Systems, London, UK.
Liu, Y. (2019). Computing systematic polygenic risk scores associations using biobank-wide association scan. 4th International Mandelian Randomization Conference, Bristol, UK.
Liu, Y. (2019). The effects of the academisation of English schools on educational trajectories of children with Special Educational Needs. 4th International Mandelian Randomization Conference, Bristol, UK.
Liu, Y. (2018). Estimating the causal effects of academisation of English schools with the data from the National Pupil Database. 2018 International Conference for Administrative Data Research, Belfast, UK.
Liu, Y. (2018). The effects of the academisation of English schools on educational trajectories of children with Special Educational Needs. National Pupil Database User Group 2018 Meeting, London, UK.
Liu, Y., & Black, A. (2017). Trends on pupils with special education needs and disabilities (SEND) and the academisation effect. National Pupil Database User Group 2017 Meeting, London, UK.
Software¶
Liu, Y. (2022). MRC IEU wiki [Computer software]. https://mrcieu.github.io/wiki
Liu, Y., & Gaunt, T. R. (2021). MRC IEU GitHub pages [Computer software]. https://mrcieu.github.io
Liu, Y., Elsworth, B., & Gaunt, T. R. (2020). EpiGraphDB software [Computer software]. https://mrcieu.github.io/software/epigraphdb
Liu, Y., Haberland, V., & Gaunt, T. R. (2020). Epigraphdb R package [Computer software]. https://cran.rstudio.com/web/packages/epigraphdb
Liu, Y., & IEU OpenGWAS team. (2019). OpenGWAS reports [Computer software]. https://github.com/MRCIEU/opengwas-reports
Liu, Y., Elsworth, B., & Gaunt, T. R. (2019). Vectology [Computer software]. http://vectology.mrcieu.ac.uk
Academic activities¶
Grants¶
Assessing capability of large AI models in text mining of cancer studies, University of Bristol Cancer Research Fund seedcorn funding, 2023, Liu, Y. (Co-PI), Xu, Z. (Co-PI), Sobczyk-Barad, M., Gaunt T.R., Simpson, E.
Data acquisition and pilot study on BioRxiv and MedRxiv full text data to facilitate comprehensive data mining on biomedical literature, Elizabeth Blackwell Institute Rapid Research Funding Call Award 2023, 2023, Liu, Y. (PI), Gaunt T.R.
Automation in systematic reviews. WCRF UK. 2023 -- 2026. Liu, Y. (PI 2024 - present; Co-PI 2023), Sobczyk-Barad, M. (Co-PI 2023), Millard, L., Martin, R., Gaunt T.R.
Local authority responses to diversity: school placement trends. 2018 -- 2019. BA/Leverhulme Small Research Grants, Sakellariadis, A. (PI), Liu, Y., Black, A., Norwich, B.
Non-grant projects¶
Data extraction of statistical findings from epidemiological and public health studies using LLMs. 2024. Isambard-AI Technical and Preparatory Projects. Gaunt T. R. (PI), Liu, Y. (Co-I), Xu, Z.
PhD Student supervisions¶
Xuyutian Wang. Supervised by Tom Gaunt, Yi Liu.
Winfred Gatua. Supervised by Tom Gaunt, Yi Liu, Deborah Lawlor, Maria Sobzcyk-Barad, Jie Zheng.
Oliver Lloyd. Supervised by Tom Gaunt, Yi Liu, Patrick Rubin-Delanchy. Completed.
Marina Vabistsevits. Supervised by Tom Gaunt, Tim Robinson, Yi Liu, Benjamin Elsworth. Completed.
Referee for Journals¶
Journal of Personalized Medicine; Healthcare; Applied Sciences; Genes; Symmetry; Behavioural Sciences; BioData Mining; Electronics;