Genomic Data Machined: The Random Forest Algorithm for Discovering Breast Cancer Biomarkers
Year:
2023Published in:
SpringerAdvanced data analysis tools and bioinformatics are essential foruncovering the nature of breast cancer, which is the leading cause of cancer deathamong women. The goal of this study is to identify potential genomic biomark-ers that have a significant impact on four prognostic factors, including tumoursize, lymph node involvement, metastasis, and overall survival status. The Ran-dom Forest algorithm has been trained on data from The Cancer Genome AtlasBreast Cancer, which contains the expression values of 19,737 genes. In orderto obtain the optimal learning model, the process has been repeated 20 times foreach indicator, and only the genes with a p-value <0.05 were taken into furtherconsideration. Several performance metrics (e.g., F1 score) were calculated tocheck the algorithm’s reliability. As a result, 97 and 7 genes were included in theextended and final databases, respectively. The chosen genes have been provento play a critical role in cancer-related pathways, such as Toll-like receptor andNF-κB, and have effects on cell proliferation, tumour formation, and angiogene-sis. Thus, this study demonstrates the potential of machine learning analyses forbiomedical purposes and provides machine-generated insights into breast cancerdevelopment, setting the groundwork for further in vitro examinations to validatethe prognostic potential of these biomarkers
Related by author
18 publications found
COVID‑19 vaccinations and rates of infections, hospitalizations, ICU admissions, and deaths in Europe during SARS-CoV‑2 Omicron wave in the first quarter of 2022
Publisher: John Wiley & Sons
Authors: Nadiia Kasianchuk, D. Sikora, B. Poniedziałek, P. Rzymski
Random Forest Algorithm in Unravelling Biomarkers of Breast Cancer Progression
Publisher: Anhalt University
Authors: Nadiia Kasianchuk, Dmytro Tsvyk, Eduard Siemens, Halina Falfushynska
Illicit Drugs in Surface Waters: How to Get Fish off the Addictive Hook
Publisher: MDPI AG
Authors: Nadiia Kasianchuk, Halina Falfushynska, Yuliia Faidiuk, P. Rzymski, Anastasiia Boshtova, Piotr Rychter
Multimarker Responses of Zebrafish to the Effect of Ibuprofen and Gemfibrozil in Environmentally Relevant Concentrations
Publisher: Springer Nature
Authors: Nadiia Kasianchuk, Halina Falfushynska, O. Horyn, O. Bodnar, D. Poznanskyi
Scrutinised and compared: HVG identification methods in terms of common metrics
Publisher: ICAIIT
Authors: Nadiia Kasianchuk, Yevhenii Kukuruza, Vladyslav Ostash, Anastasiia Boshtova, Dmytro Tsvyk, Matvii Mykhailichenko
OXIDATIVE DAMAGE IN ZEBRAFISH EXPOSED TO ENVIRONMENT REALISTIC CONCENTRATIONS OF ROUNDUP AND CHLORPYRIFOS
Publisher: STEF92 Technology
Authors: Halina Falfushynska, Nadiia Kasianchuk, O. Bodnar, O. Horyn, I. Khatib
The effects of ZnO nanostructures of different morphology on bioenergetics and stress response biomarkers of the blue mussels Mytilus edulis
Publisher: Elsevier
Authors: Nadiia Kasianchuk, Joydeep Dutta, Sergey Dobretsov, Inna Sokolova, Fei Ye, Fangli Wu, Halina Falfushynska
The biomedical potential of tardigrade proteins: A review
Publisher: Elsevier
Authors: Nadiia Kasianchuk, P. Rzymski, Kaczmarek
Pooled biochemical and mRNA markers of the blue mussels Mytilus edulis
Publisher: PANGAEA
Authors: Inna Sokolova, Nadiia Kasianchuk, Halina Falfushynska, Fangli Wu, Fei Ye, Joydeep Dutta, Sergey Dobretsov
Network analysis identified novel disease module in rheumatoid arthritis
Publisher: Biopolymers and Cell
Authors: Nadiia Kasianchuk, P. Fau, A. Romanenko, A. Kompaniiets, P. Havrysh, Matvii Mykhailichenko, D. Nishchenko, F. Iordachi, O. Petrenko