Organisation Name
University of São Paulo
Industry
Health services
Organisation Website
Country
Brazil
Sustainable Development Goals (SDGs)
SDG 3: Good Health and Well-being
SDG 10: Reduced Inequality
General Description of the AI tool
BioAutoML automatically runs an end-to-end ML pipeline that can be effectively employed by non-experts. To the best of our knowledge, our proposal automates the longest pipeline for biological sequence analysis, encompassing feature engineering, ML algorithm recommendation, and hyperparameter tuning. So far, we have achieved promising results on several problems, such as SARS-CoV-2, anticancer peptides, pro-inflammatory peptides, HIV sequences, and phage virion proteins. BioAutoML lowers the barrier for non-experts, democratizing ML in life sciences.
Github, open data repository
https://github.com/Bonidia/BioAutoML
Relevant Research and Publications
1 — [IF 2021: 13.994] BONIDIA, ROBSON P; SANTOS, ANDERSON P AVILA; DE ALMEIDA, BRENO L S; STADLER, PETER F; DA ROCHA, ULISSES N; SANCHES, DANILO S; DE CARVALHO, ANDRÉ C P L F. BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria. Briefings in Bioinformatics, v. 1, p. 1-13, 2022.
2 — (Core of BioAutoML, fundamental for its operation) [IF 2021: 13.994] BONIDIA, ROBSON P; DOMINGUES, DOUGLAS S; SANCHES, DANILO S; DE CARVALHO, ANDRÉ C P L F. MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors. Briefings in Bioinformatics, v. 1, p. 1-10, 2022.
3 — [IF 2021: 2.738] BONIDIA, ROBSON P; SANTOS, ANDERSON P AVILA; DE ALMEIDA, BRENO L S; STADLER, PETER F; DA ROCHA, ULISSES N; SANCHES, DANILO S; DE CARVALHO, ANDRÉ C P L F. Information Theory for Biological Sequence Classification: A Novel Feature Extraction Technique Based on Tsallis Entropy. Entropy, v. 24, p. 1398, 2022.
4 — Google Latin America Research Awards (LARA), Google, 2021. Project: BioAutoML. Elected by LARA-Google among the 24 most promising ideas in Latin America (24 awarded projects, from a base of 700 submissions).
5 — BioAutoML — Finalists (Top 15 of 82) — Falling Walls Lab Brazil 2022, DWIH São Paulo, Falling Walls Foundation, DAAD The German Center for Science and Innovation.
For other solutions developed using BioAutoML, explore our website and GitHub repository. These resources showcase diverse applications and advancements made possible by BioAutoML's innovative AI capabilities.
Link-1: https://bonidia.github.io/BioAutoML-WP/
Link-2: http://autoaipandemics.icmc.usp.br/
Link-3: https://github.com/Bonidia/
Needs
Funding
Public Exposure
HPC resources and/or Cloud Computing Services