EDITORIAL
What is already known about the subject?
- Medical genetic research aimed at identifying genetic conditioning or predisposition to diseases has become relevant recently due to the trend towards personalized medicine.
- Establishment of biobanks and biobank networks in various fields of biomedicine are modern tools that increase the efficiency and quality of scientific research.
What might this study add?
- The review summarizes the data of numerous publications on genetic research based on biosample collections and biobank databases. The role of Russian biobanks in the development of this area is described. It is due to biobanks that large-scale projects have been carried out, allowing to create genetic risk scores, diagnostic systems, to obtain data on associations of genetic variants with pathological conditions in order to implement personalized approaches to medicine.
To implement a modern personalized approach in practical healthcare, the latest biomedical technologies should be developed and genetic research should be performed. The analysis of a substantial quantity of data is essential for the investigation of the prevalence of genetic risk factors for various diseases, drug resistance genes, the development of genetic panels to determine the individual risk of pathologies, as well as the creation of genetic risk scores. The review demonstrates through the use of illustrative examples that contemporary biobanks have become a vital component in the field of genetics research, both in Russia and globally. These specialized institutions are capable of accumulating, storing, and utilizing a substantial quantity of biological samples and related data, which is essential for advancing genetic research. The data collected in biobanks and associated clinical information form the basis for large-scale genetic studies conducted in different countries. The efficacy of genetic advancements, such as the early diagnosis of diseases, is contingent upon the number of biobanks, the establishment of collaborative networks among them, and the capacity to leverage digital platforms uniting diverse databases. Biobanks and biobanking have emerged as the foundation for the advancement of personalized medicine.
ORIGINAL ARTICLE
What is already known about the subject?
- Study of rare and unusual clinical cases is associated with a number of difficulties due to complicated creation of a cohort of the size required for the study.
- Currently developing biobanking technologies give new opportunities for clinical and epidemiological research.
What might this study add?
- The proposed approach to preserving biospecimens and related data in patients with rare clinical cases of scientific interest is important and effective for solving a number of personalized medicine problems.
- The proposed approach allows solving a wide range of clinical and research problems in personalized medicine as follows: timely diagnosis or clarification; formation of patient cohorts to study the genetic aspects of diseases; identification of new genetic variants of hereditary diseases; development of genetic diagnostic panels; study of rare diseases; reduction of sample creation time in case of novel scientific ideas.
Aim. To present an approach to collection of biosamples of patients with rare and scientifically interesting clinical situations for solving the problems of personalized medicine, as well as to analyze related 8-year experience.
Material and methods. The approach and a collection of biosamples of blood and its derivatives is developed at the National Medical Research Center for Therapy and Preventive Medicine within the project "Interesting Cases at the National Medical Research Center for Therapy and Preventive Medicine". The collection of biomaterial from project patients was carried out on a planned basis with the inpatient department, as well as upon referral from outpatient doctors. All included patients signed informed consent. Each biosample is accompanied by an extensive annotation, including socio-demographic, clinical, genetic and other types of data. The article presents the project results as of August 12, 2024.
Results. An expert group developed 15 disease groups and related inclusion criteria. At the time of analysis, 4525 inpatients and outpatients were included in the project. Positive changes in the number of people included annually is noted. Genetic testing was performed on >2500 patients. The proposed approach allows solving a wide range of clinical and research problems in personalized medicine as follows: timely diagnosis or clarification; formation of patient cohorts to study the genetic aspects of diseases; identification of new genetic variants of hereditary diseases; development of genetic diagnostic panels; study of rare diseases; reduction of sample creation time in case of novel scientific ideas.
Conclusion. The proposed approach to the collection and preservation of biosamples and related clinical, socio-demographic, genetic and other types of data in patients with rare clinical cases of scientific interest is important and effective for solving practical and research problems of personalized medicine. The algorithm is well developed, standardized and easily implemented within the clinics, regardless of their size. Preanalytical phase standardization creates the prerequisites for multicenter national and international cooperation.
What is already known about the subject?
- Biobanking is the basis for research on tumor-associated markers in biological fluids.
- Small non-coding ribonucleic acid molecules (microRNA) are the most promising circulating biomarkers of glial tumors.
What might this study add?
- A collection of plasma samples from patients with primary and secondary brain tumors was created.
- A fraction of microRNA was extracted from the plasma and subjected to next-generation sequencing to search for tumor-specific glioma markers.
Aim. To create a collection of plasma samples of patients with brain tumors (BTs) for the development of a diagnostic microRNA (ribonucleic acid) panel of glial tumors.
Material and methods. Plasma samples of patients with benign and malignant BTs were obtained by double centrifugation of whole blood and then frozen at -75оС.Fifty-nine RNA samples isolated from blood plasma were analyzed by next-generation sequencing (NGS).
Results. Currently, the biobank contains samples from 339 patients with primary and secondary BTs and 10 control group individuals (698 samples — 2 plasma aliquots per individual), including 143 men and 206 women. The age of the patients ranged from 19 to 91 years (median — 56 years). Primary BTs (41%) included two following groups: benign (33,7%) and malignant (66,3%). Meningiomas constituted the bulk (91%) of the benign BTs. Among the malignant tumors, glioblastomas (46,7%) and astrocytomas (41,6%) prevailed, while oligodendrogliomas and ependymomas accounted for only 9,1 and 2,5%, respectively. Secondary BTs (59%) are represented by recurrent glial tumors (92,5%) and metastatic tumors (7,5%) of lung cancer (71,4%) and breast cancer (28.6%). A protocol for the primary preparation of liquid biopsy samples was implemented, which made it possible to obtain high-quality deoxyribonucleic acid libraries for the selected microRNA profiling platform.
Conclusion. The creation of a plasma sample collection is the basis for searching circulating biomarkers of BTs.
What is already known about the subject?
- The biobank is a universal and reliable high-tech platform for processing, long-term storage, extraction of biomaterial for genetic and biochemical analysis in cardiovascular patients.
- Elevated levels of lipoprotein transport and metabolism markers specify the cardiovascular prognosis and the effectiveness of lipid-lowering therapy in patients with coronary artery disease.
What might this study add?
- A biomaterial collection of the Biobank of Yugra laboratory made it possible to conduct a single-center prospective study of biochemical markers in a cohort of patients with myocardial infarction living in the Khanty-Mansi Autonomous Okrug.
- Serum levels of lipoprotein transport and metabolism markers (PCSK9, lipoprotein(a), triglyceride-glucose index), determined in patients with acute myocardial infarction, are reliably associated with clinical factors, comorbidity, vascular atherosclerosis and a negative prognosis in the post-infarction period.
Aim. To evaluate clinical associations and predictive value of lipoprotein transport and metabolism markers determined in a collection of serum samples from patients with myocardial infarction (MI).
Material and methods. Collection of blood samples from 88 patients with the acute MI was created in the Biobank of Yugra laboratory for subsequent biochemical assessment of serum levels of lipoprotein transport and metabolism markers. All patients were included in a clinical prospective study for 48 months with registration of medical events and remote tomographic assessment of coronary artery (CA) involvement upon follow-up completion.
Results. Direct associations were established between proprotein convertase subtilisin/kexin type 9 (PCSK9) levels and low-density lipoprotein cholesterol levels, coronary atherosclerosis, Global Registry of Acute Coronary Events (GRACE) 2.0 risk score, and the risk of recurrent acute coronary syndrome. Lipoprotein (a) levels >50 mg/dl were detected in 28,4% of patients with MI and were associated with prior medical events, coronary and non-coronary artery atherosclerosis, comorbidity, recurrent coronary lesions, and the risk of major cardiac events at the end of follow-up. A relationship was established between a high (5,14) triglyceride-glucose index and comorbidity, recurrent coronary lesions, and the death risk at the end of follow-up.
Conclusion. The Biobank of Yugra laboratory is an effective base for laboratory research. Lipoprotein transport and metabolism markers are associated with clinical factors, comorbidity, vascular atherosclerosis and a negative prognosis in patients with MI.
What is already known about the subject?
- Genomic sequence databases are an independent intellectual resource in demand by many researchers.
- The Virus Genome Aggregator of Russia (VGARus) database was created to deposit information obtained during molecular genetic studies in Russia.
What might this study add?
- VGARus makes it possible to monitor the mutational variability of viruses, to carry out operational and retrospective analysis of their prevalence in Russia.
- Depositing information on the Epstein-Barr virus sequences in VGARus makes it possible to study its variability and impact on epidemic process, to develop countermeasures, including specific immunoprophylaxis.
Despite the fact that databanks of genomic sequences have clear information links with biological samples, they are an independent intellectual resource in demand among many researchers. To deposit information obtained in the course of molecular genetic studies in Russia, the Virus Genome Aggregator of Russia (VGARus) databank was created.
Aim. To evaluate data on the variability of genetic sequences of the Epstein-Barr virus (EBV), obtained as a result of Russian studies, and the possibility of their integration into the VGARus database for
subsequent use in epidemiological surveillance.
Material and methods. A search for publications was carried out in the PubMed, eLibrary, Cyberleninka databases. In total, the study included 32 papers of Russian and external authors that meet the stated aim. Results. The number of studies containing data on the EBV genetic sequences in the Russian Federation is extremely small. The information presented in publications indicates geographical differences in the ratio of two EBV genotypes, the presence of a special LMP1 TatK gene variant in the Tatars of the Volga region, the difference between Russian EBV samples in the gene encoding the gp350 from those from other world regions. At the same time, information on the genomic sequences obtained in the studies was not deposited in the Russian gene bank in any case.
Conclusion. Expanding the potential of the Russian VGARus platform by including information on the genomic sequences of all pathogenic microorganisms circulating in the Russian Federation will require largescale work that takes into account technical features, biological and information security requirements.
What is already known about the subject?
- Population biobanks contain unique information about clinically significant deoxyribonucleic acid (DNA) markers in the indigenous population. However, the incidence rates are presented not for the indigenous population, but for the general population of the regions.
What might this study add?
- A technology has been developed for assessing the DNA marker frequencies in Russian regions based on data on the gene pools of the indigenous peoples of Russia and ethno-demographic information in regions.
- The developed approach allows for a quantitative assessment of the correlation of clinically significant DNA markers with incidence rates in any administrative units, and for calculations for any regions and periods with a known ethnic composition and incidence.
Information on morbidity is presented in statistical reports for the entire population of multinational subjects of Russia, but population Biobanks contain information on individual peoples.
Aim. To develop a technology for assessing the frequency of deoxyribonucleic acid (DNA) markers for administrative units based on indigenous population data, which will make it possible to assess the relationship between the frequency of DNA markers and morbidity.
Material and methods. The Biobank of Northern Eurasia presented the frequencies of 24 DNA markers associated with cardiovascular disease (CVD) for 86 indigenous peoples of Russia (6227 DNA samples). Information on the ethnic composition of 83 Russian regions was extracted from the 2010 census, and the CVD incidence in 85 Russian regions (2020-2021) was assessed by data from the Ministry of Health of Russia.
Results. The data on the gene pools of the indigenous population and the ethnic composition of Russian regions made it possible to assess the frequencies of 24 DNA markers in 85 Russian regions. The obtained frequencies allowed us to directly assess their relationship with CVD in Russia — chronic rheumatic heart diseases, cardiomyopathy, coronary artery disease, hypertension, acute myocarditis. Technology application data are presented in the form of tables and genogeographic maps of the distribution of 24 CVD DNA markers in Russian subjects, the geography of morbidity in Russian subjects for five CVDs, and the correlation of the distribution of the frequency of CVD DNA markers and the incidence of CVD among the population of Russia.
Conclusion. The proposed technology for assessing the modern gene pool in Russian subjects based on the data on the gene pool of the indigenous peoples of Russia for the first time made it possible to assess the relationship between clinically significant DNA markers and morbidity in Russia.
What is already known about the subject?
- A significant proportion of spontaneous abortions (miscarriages) in euploid pregnancy are associated with pathogenic genetic variants in the fetus.
- Large-scale collections of samples provide a unique opportunity to assess the allele frequency in the population.
What might this study add?
- RUSeq database makes it possible to initially assess the inheritance rate of the main miscarriage-related variants.
- In a healthy Russian population, both known and new genetic variants in 10 genes potentially associated with miscarriage have been discovered.
Aim. Modern large-scale biocollections and related open databases play a critical role in the development and implementation of novel approaches to prevention and diagnostics, as well as in improvement of treatment of hereditary pathologies. The aim of this study was to analyze the carriage rate of miscarriage-related variants in the Russian population presented in the RUseq database.
Material and methods. The first Russian open database of genetic variants and their rate in the Russian population (RUSeq) was used as the main source of information on allele frequencies. We analyzed 270 known genetic variants described as a cause of miscarriage. A search for pathogenic variants in 18 key miscarriage-related genes was conducted.
Results. We revealed that 10 out of 270 variants described as a miscarriage cause are found in the Russian population. In addition, 46 known or new potentially pathogenic variants were found in 10 key genes that are possible markers of miscarriage risk. In one case (NEB gene), the cumulative frequency of such variants exceeded 0,5%.
Conclusion. The obtained results emphasize the importance of genetic databases and the need for further study of miscarriage-realted gene disorders, as well as the inclusion of identified variants in preconception genetic testing programs for couples in order to determine pregnancy planning and management.
What is already known about the subject?
- The short tandem repeat (STR) profiling is one of the simplest and most convenient methods for cell line authentication, but interpretation of the results can be complicated by analysis artifacts.
What might this study add?
- The work describes for the first time in Russian the common analysis artifacts discovered during the experimental work using the STR profiling, as well as the possible reasons for their occurrence. In addition, recommendations are given for reducing the influence of the mentioned artifacts on the analysis results.
Aim. To study and describe the most common types of artifacts in detection of short tandem repeat (STR) amplicons by capillary electrophoresis and cause difficulties in interpreting the obtained STR profiles.
Material and methods. Cell lines were obtained from the bioresource collection of cell lines of the Blokhin National Medical Research Center of Oncology. DNA was isolated according to the manufacturer’s instructions of the DNeasy Blood & Tissue (QIAGEN, Germany) and ExtractDNA Blood & Cells (Evrogen, Russia) kits. DNA concentration was measured using a Qubit 4.0 device (Thermo Fisher Scientific, USA) and a Qubit dsDNA BR Assay Kit (Thermo Fisher Scientific, USA). Multiplex PCR was performed using a COrDIS EXPERT26 reagent kit (Gordiz, Russia). Capillary electrophoresis of PCR products was performed on a 3500xL Genetic Analyzer (Applied Biosystems, USA). GeneMapper Software v6.0 (Thermo Fisher Scientific, USA) was used to process electrophoresis data.
Results. The most well-known artifacts associated with the STR profiling and subsequent capillary electrophoretic separation of amplicons were studied. Cases of detection of these artifacts from personal practice are given. Recommendations for improving the electrophoresis pattern are given.
Conclusion. The paper studies the artifacts of analysis in cell line STR profiling by capillary electrophoresis (STR-CE), which researchers encounter in laboratory practice. Common types of analysis artifacts that cause difficulties in interpreting the results obtained during STR profiling, as well as possible reasons for their occurrence, are described in detail and illustrated with examples from our own practice. Recommendations are given for reducing the number of non-specific fluorescent signals and their intensity.
ПРАКТИЧЕСКИЙ ОПЫТ
What is already known about the subject?
- Obstetrics research in recent decades has rarely considered the physiology of pregnancy, but has mainly focused on complications.
- The dynamism of changes during pregnancy over a short-limited period of time makes it impossible to perform a cross-sectional assessment of a pregnant woman's body, and determines the need to implement a novel longitudinal biobanking strategy.
What might this study add?
- Two fundamentally different strategies for biobanking samples have been formulated as follows: cross-sectional and longitudinal banking.
- The collection creation using the longitudinal biobanking strategy makes it possible to study the patterns of physiological changes during pregnancy and search for early predictors of pregnancy complications.
The article highlights the experience of the "Genofund" biobank of the Ott Research Institute of Obstetrics, Gynecology and Reproductology in bioresource collection in modern obstetrics. To create different types of collections, the following different biobanking strategies were formulated for the first time: cross-sectional and longitudinal (dynamic). Definitions are given for these approaches. The most significant advantages and disadvantages of each of them are noted. Examples of pilot studies conducted on the platform of a bioresource collection created using the complex longitudinal biobanking are given. This approach made it possible to demonstrate, using the example of microRNA analysis, the potential of studying the physiological changes during pregnancy at the present stage, as well as the prospects for finding early predictors of pregnancy complications.
What is already known about the subject?
- When creating cell culture biobanks for the production of high-tech drugs and for conducting studies, researchers encounter difficulties with insufficiently developed regulatory framework, the absence of strict criteria for culture quality control.
What might this study add?
- The developed methods, including quality assurance and control, made it possible to create the largest collection of cultures of mesenchymal stromal cells, fibroblasts, early-passaged human umbilical vein endothelial cells in the North-West region, consisting of certified samples that meet existing cell culture requirements for the production of high-tech drugs, as well as for research.
Modern research in cell biology places high demands on the model objects used, including cell cultures. These requirements are dictated by the need to avoid artifacts in work and reduce potential risks to the environment associated with biological hazard. The article presents the experience of creating cell culture bank for personal and public storage at the Pokrovsky Cell Technology Center. The main points of managing the collection, incoming quality control, certification, processing, preparation for storage and storage of human biological materials are considered. The developed approaches and work algorithms made it possible to create a collection of early passage cultures consisting of certified samples that meet the requirements for cell cultures for the manufacture of high-tech drugs and biomedical cell products, as well as for research.
What is already known about the subject?
- RNA sequencing is currently the priority method for identifying differentially expressed genes and obtaining valuable information for assessing pathological conditions.
- Identification of master regulator (MR) genes can be the basis for developing novel therapies.
What might this study add?
- The fundamental potential of RNA sequencing with MR identification in choosing targeted therapy for patients with colorectal cancer is shown.
- Therapy prescribed without taking into account MR analysis may not be optimal.
Aim. To study the applicability of RNA (ribonucleic acid) sequencing with master regulator identification for predicting the effectiveness of targeted therapy in patients with colorectal cancer.
Materials and methods. Tissue samples from three patients with colorectal cancer obtained from postoperative material were used. All patients received palliative antitumor therapy in standard regimens in accordance with the tumor location and the status of RAS and BRAF gene mutations. The transcriptome of tumor and healthy tissue of each patient was sequenced, and master regulators in the tumor tissue were analyzed.
Results. A list of master regulators was found for each patient and possible therapeutic agents most suitable for suppressing the tumor process were predicted.
Conclusion. The potential of computer analysis of the molecular profile of colorectal adenocarcinoma in predicting the effectiveness of therapy is shown. However, to determine the clinical potential of this technique, a study on a wider sample is required.
What is already known about the subject?
- Preanalytical factors for processing and storing blood plasma have a direct impact on cell-free deoxyribonucleic acid (cfDNA) studies.
- Clear boundaries have not been defined for the maximum permissible excess of freeze-thaw cycles and plasma storage time when using stabilization tubes with a preservative compared to EDTA tubes for noninvasive prenatal testing (NIPT).
What might this study add?
- NIPT using stabilization tubes without observing the preanalytical factors for plasma storage (long-term storage and multiple freeze-thaw cycles) cannot be considered a sufficient criterion for assessing the good quality of preanalytical plasma processing.
- Most studies describing manipulations with cfDNA recommend a single plasma freeze-thaw (which is associated with the use of EDTA tubes); however, studies conducted with plasma stabilized in tubes with a preservative make it possible to assume that these parameters (long-term storage and multiple freeze-thaw cycles) should be adjusted upward.
Aim. To assess non-invasive prenatal testing (NIPT) as an informative criterion for quality of blood plasma and cell-free deoxyribonucleic acid (DNA) (cfDNA) in the case of using stabilization tubes at the preanalytical phase, in the example of plasma samples subjected to long-term storage at room temperature (+18о С) and multiple freezethaw cycles.
Material and methods. The plasma samples were subjected to 20 freeze-thaw cycles (-80о С/+18о С), 20-day storage at +18о С with an intermediate cfDNA assessment. The quantitative yield was assessed by fluorometry, while the fragmentation and NIPT data — using realtime polymerase chain reaction (PCR) and high-throughput sequencing, respectively.
Results. After multiple freeze-thaw cycles and long-term plasma storage at room temperature (+18о С), a decrease in the integrity and the concentration of cfDNA by the 20th thawing cycle, as well as a tendency to an increase in concentration by 20 days of storage were observed. Despite this, the NIPT results of the studied samples showed a high degree of coincidence with the NIPT data of the reference samples.
Conclusion. The mere fact of successful NIPT cannot be considered as a reliable and sufficient criterion for assessing the quality of initial plasma and correct preanalytics. This emphasizes the particular importance of monitoring the conditions for transporting and storing plasma and whole blood samples.
REVIEW ARTICLES
What is already known about the subject?
- MicroRNAs (small non-coding ribonucleic acid molecules) are involved in post-transcriptional regulation of gene expression and affect various pathological conditions.
- Standardization of preanalytical variables is a critical step in clinical implementation as well as in research of circulating microRNAs.
What might this study add?
- Maximum stability of microRNA is achieved by storing serum and plasma at -80оC while minimizing the number of freeze-thaw cycles.
Over the past decade, circulating small non-coding ribonucleic acid molecules (microRNAs) have demonstrated their potential as minimally invasive diagnostic and prognostic biomarkers of various diseases. Standardization of preanalytical and analytical factors, including collection, processing and storage of biosamples, plays a significant role in the reliability and reproducibility of circulating microRNA quantification. To date, there is no consensus regarding the data normalization used in the analysis of circulating microRNA expression. The review aim is to consider modern original papers on various storage conditions of biobanked plasma and serum samples with subsequent isolation of circulating microRNAs for analysis.
What is already known about the subject?
- Biobanks are an important integral tool of modern research projects in medicine.
- Large studies with comprehensive collections of high-quality biological samples in combination with clinical, paraclinical and socio-demographic data are necessary for a better understanding of the etiology and pathogenesis of cardiovascular diseases.
What might this study add?
- Priority areas for the use of biobanks in cardiology are shown.
- Research using biobank resources is necessary to study the pathogenetic mechanisms of cardiovascular diseases, as well as to search and analyze polygenic risk factors and predictors of adverse cardiovascular events and conduct clinical trials.
A systematic review of publications from the PubMed and eLibrary. ru databases, Biobanking and Biomolecular Resources Research Infrastructure — European Research Infrastructure Consortium (BBMRI-ERIC) and ClinicalTrials.gov studies was carried out for 15 years. The aim was to find priority areas for the use of biobanks in cardiology. The key areas of research on blood and heart tissue biobanks are the study of pathogenetic mechanisms, creation of innovative methods for diagnosis, treatment and prevention of cardiovascular diseases (CVDs). The use of modern technologies such as genomics, transcriptomics, proteomics and metabolomics allows identifying candidate markers, revealing new molecular targets for drug therapy, diagnostic and therapeutic approaches for CVD. One of the promising areas is the search and study of polygenic scores of CVD risk and predictors of adverse cardiovascular events. Analysis of the registry revealed another important area of biobank application — clinical trials, in which biobanks are a key resource of blood and tissue samples, as well as clinical, paraclinical, and socio-demographic data. Therefore, studies using biobank resources are necessary to study the pathogenetic mechanisms of CVD, identify new proteomic biomarkers and genetic factors, as well as to improve diagnostics, prevention and treatment.
What is already known about the subject?
- Cell-free microRNAs are present in plasma and serum and can potentially be used as biomarkers of various diseases.
- The preanalytical phase of studies, including collection, processing, storage and transportation of biospecimens, can play an important role in the reliability and reproducibility of circulating microRNA quantification.
What might this study add?
- No standardization of sample processing protocols for the analysis of circulating microRNAs in plasma and serum has a key impact on the reproducibility of study results.
- Experimental data are contradictory and do not make it possible to establish optimal conditions for all sample preparation stages.
Circulating microribonucleic acids (microRNAs) are promising biomarkers of various diseases, but their clinical laboratory use requires highly sensitive, reproducible, reliable and sustainable methods for their accurate plasma and serum quantification. The preanalytical phase of studies conducted using biospecimens consists of their collection, processing, storage and transportation. Preanalytical conditions remain the main distorting factors in microRNA studies, and standardization of these conditions, carried out in biobanks, can improve the reproducibility of results and their comparison. The review aim is to consider the main contemporary original studies on preanalytical factors, which are an important source of variability in studies on circulating microRNAs at the stages from blood collection to plasma or serum production.
What is already known about the subject?
- One of the important functions of small RNA molecules is the post-transcriptional regulation of gene expression.
- With differential expression, RNA molecules of different classes can be identified, including small RNA molecules, which are potential markers of a wide range of diseases.
- Bioinformatics analysis programs differ in operating speed, usability, availability, a set of additional parameters and other characteristics.
- The development of a bioinformatics protocol for analyzing high-throughput sequencing data contributes to the standardization of preanalytical factors, which is an important step in clinical application, as well as research of small RNA molecules.
What might this study add?
- A universal protocol for processing the results of high-throughput sequencing of small RNA molecules, its main stages and the most widely used programs are described.
High-throughput sequencing of small ribonucleic acid (RNA) molecules is widely used to search for markers of various diseases, as well as to study the regulation of gene expression. The data processing protocol consists of many stages, including the stages of analyzing the initial data quality and sequencing results, mapping and studying the expression profile of the detected small RNA molecules. A whole arsenal of programs and specific packages has already been developed to implement each study step. The instrumental composition of the final bioinformatics protocol is critically important for the correct data processing and study reproduction. This review describes the most universal protocol for processing the results of high-throughput sequencing of small RNA molecules, including all the main stages and the most widely used programs.
ISSN 2619-0125 (Online)