Creating bioinformatics matrix of molecular markers to predict risk-associated health disorders

View or download the full article: 
57.044; 616.092

M.A. Zemlyanova1,2,3, N.V. Zaitseva1, Yu.V. Koldibekova1, E.V. Peskova1,2, N.I. Bulatova1


1Federal Scientific Center for Medical and Preventive Health Risk Management Technologies, 82 Monastyrskaya Str., Perm, 614045, Russian Federation
2Perm State University, 15 Bukireva Str., Perm, 614990, Russian Federation
3Perm National Research Polytechnic University, 29 Komsomolsky Ave., Perm, 614990, Russian Federation


Long-term or permanent chemical ambient air pollution in residential areas is among priority factors that cause medical and demographic losses. It is necessary to achieve greater precision when assessing risks of changes in homeostasis at their early reversible stage (molecular level). These changes are highly likely to transform into pathological processes at an older age in case the exposure persists.

Our research goal was to create a bioinformatics matrix of molecular markers to predict risk-associated health disorders (exemplified by a marker of exposure). We introduced a stepwise research algorithm that involved using the proteome technology to identify expressed proteins and cause-effect relations between them and influencing factors; revealing molecular-cellular and functional relationships within the “exposure factor – gene – protein – negative outcome” system to predict risk-associated health disorders. The algorithm was implemented to examine the proteomic blood plasma profile of children aged 3–6 years living under long-term aerogenic exposure to fluoride-containing compounds.

We established certain changes in the proteomic profiles of the exposed children in comparison with non-exposed ones as per 27 identified proteins. A bioinformatics matrix was created on the example of cathepsin L1; we established that changes in the level of this protein had a cause-effect relationship with fluoride ion concentrations in urine. Qualitative synthesis of molecular-cellular localization, functional and tissue belonging showed that cathepsin L1 expression caused by elevated fluoride ion levels in urine could affect extracellular matrix remodeling, degradation and post-translation modification of proteins in cells of the lungs, large intestine, and pancreas, in cardiomyocytes and in glomerular podocytes. It also mediated proteolysis of the subunits of the SARS-CoV-2 S1 protein necessary for the virus penetration into a cell and its replication. This created bioinformatics matrix exemplified by cathepsin L1 made it possible to predict risk-associated negative outcomes in exposed people including cardiomyopathy, colitis, glomerulonephritis, diabetes mellitus, atherosclerosis, and coronavirus infection. These predictive estimates raise effectiveness of early detection and development of preventive measures aimed at minimizing possible negative outcomes.

proteomic profile, children, cellular-molecular and tissue belonging, molecular markers, gene expression, negative effects, fluoride ion in urine, cathepsin L1, bioinformatics resources
Zemlyanova M.A., Zaitseva N.V., Koldibekova Yu.V., Peskova E.V., Bulatova N.I. Creating bioinformatics matrix of molecular markers to predict risk-associated health disorders. Health Risk Analysis, 2022, no. 2, pp. 174–184. DOI: 10.21668/health.risk/2022.2.16.eng
  1. Action plan for the prevention and control of noncommunicable diseases in the WHO European Region. WHO, 2016. Available at:
    WHO-EURO-2016-2582-42338-58618-eng.pdf?sequence=1&isAllowed=y (02.03.2022).
  2. Human biomonitoring: facts and figures. Copenhagen, WHO Regional Office for Europe, 2015. Available at: (02.03.2022).
  3. Miroshnichenko I.I., Ptitsina S.N. Biomarkers in the modern medical and biologic practice. Biomeditsinskaya khimiya, 2009, vol. 55, no. 4, pp. 425–440 (in Russian).
  4. Anderson N.L., Anderson N.G. The human plasma proteome: history, character, and diagnostic prospects. Mol. Cell. Proteomics, 2002, vol. 1, no. 11, pp. 845–867. DOI: 10.1074/mcp.r200007-mcp200
  5. Baer B., Millar A.H. Proteomics in evolutionary ecology. J. Proteomics, 2016, vol. 135, pp. 4–11. DOI: 10.1016/j.jprot.2015.09.031
  6. Frenkel-Morgenstern M., Cohen A.A., Geva-Zatorsky N., Eden E., Prilusky J., Issaeva I., Sigal A., Cohen-Saidon C. [et al.]. Dynamic Proteomics: a database for dynamics and localizations of endogenous fluorescently-tagged proteins in living human cells. Nucleic Acids Res., 2010, vol. 38, suppl. 1, pp. D508–D512. DOI: 10.1093/nar/gkp808
  7. Mi H., Muruganujan A., Thomas P.D. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res., 2003, vol. 41, pp. D377–D386. DOI: 10.1093/nar/gks1118
  8. Baumgartner C., Osl M., Netzer M. Baumgartner D. Bioinformatic-driven search for metabolic biomarkers in disease. J. Clin. Bioinforma, 2011, vol. 1, no. 1, pp. 2. DOI: 10.1186/2043-9113-1-2
  9. Palasca O., Santos A., Stolte Ch., Gorodkin J., Jensen L.J. TISSUES 2.0: an integrative web resource on mammalian tissue expression. Database, 2018, vol. 2018, pp. 1–12. DOI: 10.1093/database/bay003
  10. Mi H., Muruganujan A., Casagrande J.T., Thomas P.D. Large-scale gene function analysis with the PANTHER classification system. Nat. Protoc., 2013, vol. 8, no. 8, pp. 1551–1566. DOI: 10.1038/nprot.2013.092
  11. Sun Z., Li S., Yu Y., Chen H., Ommati M.M., Manthari R.K., Niu R., Wang J. Alterations in epididymal proteomics and antioxidant activity of mice exposed to fluoride. Arch. Toxicol., 2018, vol. 92, no. 1, pp. 169–180. DOI: 10.1007/s00204-017-2054-2
  12. Jormsjö S., Wuttge D.M., Sirsjö A., Whatling C., Hamsten A., Stemme S., Eriksson P. Dif-ferential expression of cysteine and aspartic proteases during progression of atherosclerosis in apolipoprotein E-deficient mice. Am. J. Pathol., 2002, vol. 161, no. 3, pp. 939–945. DOI: 10.1016/S0002-9440(10)64254-X
  13. Tang Q., Cai J., Shen D., Bian Z., Yan L., Wang Y.-X., Lan J., Zhuang G.-Q. [et al.]. Lysosomal cysteine peptidase cathepsin L protects against cardiac hypertrophy through blocking AKT/GSK3β signaling. J. Mol. Med. (Berl.), 2008, vol. 87, no. 3, pp. 249–260. DOI: 10.1007/s00109-008-0423-2
  14. Hsing L.C., Kirk E.A., McMillen T.S., Hsiao S.-H., Caldwell M., Houston B., Rudensky A.Y., LeBoeuf R.C. Roles for cathepsins S, L, and B in insulitis and diabetes in the NOD mouse. J. Autoimmun., 2010, vol. 34, no. 2, pp. 96–104. DOI: 10.1016/j.jaut.2009.07.003
  15. Reiser J., Oh J., Shirato I., Asanuma K., Hug A., Mundel T.M., Honey K., Ishidoh K. [et al.].
    Podocyte migration during nephrotic syndrome requires a coordinated interplay between cathepsin L and alpha3 integrin. J. Biol. Chem., 2004, vol. 279, no. 33, pp. 34827–34832. DOI: 10.1074/jbc.m401973200
  16. Liao M.C., Miyata K.N., Chang S.Y., Zhao X.P., Lo C.S., El-Mortada M.A., Peng J., Chenier I. [et al.]. Angiotensin II type-2-receptor stimulation ameliorates focal and segmental glo-merulosclerosis in mice. Clin. Sci. (Lond.), 2022, vol. 136, no. 10, pp. 715–731. DOI: 10.1042/CS20220188
  17. Cattaruzza F., Lyo V., Jones E., Pham D., Hawkins J., Kirkwood K., Valdez–Morales E., Ibeakanma C.H. [et al.]. Cathepsin S is activated during colitis and causes visceral hyperalgesia by a par2-dependent mechanism in mice. Gastroenterology, 2011, vol. 141, no. 5, pp. 1864–1874.e1-3. DOI: 10.1053/j.gastro.2011.07.035
  18. Zhou N., Pan T., Zhang J., Li Q., Zhang X., Bai C., Huang F., Peng T. [et al.]. Glycopeptide Antibiotics Potently Inhibit Cathepsin L in the Late Endosome/Lysosome and Block the Entry of Ebola Virus, Middle East Respiratory Syndrome Coronavirus (MERS-CoV), and Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV). J. Biol. Chem., 2016, vol. 291, no. 17, pp. 9218–9232. DOI: 10.1074/jbc.M116.716100
  19. Xiong Y., Liu Y., Cao L., Wang D., Guo M., Jiang A., Guo D., Hu W. [et al.]. Transcriptomic characteristics of bronchoalveolar lavage fluid and peripheral blood mononuclear cells in COVID-19 patients. Emerg. Microbes Infect., 2020, vol. 9, no. 1, pp. 761–770. DOI: 10.1080/22221751.2020.1747363
  20. Adedeji A.O., Severson W., Jonsson C., Singh K., Weiss S.R., Sarafianos S.G. Novel inhibitors of severe acute respiratory syndrome coronavirus entry that act by three distinct mechanisms. J. Virol., 2013, vol. 87, no. 14, pp. 8017–8028. DOI: 10.1128/JVI.00998-13
Accepted for publication: 

You are here