2025 | Brazil | Health | Outstanding | SDG10 | SDG3
BioAutoML: Democratizing Machine Learning in Life Sciences

Organisation Name

University of São Paulo

Industry

Health services

Organisation Website

https://www.icmc.usp.br/

Country

Brazil

Sustainable Development Goals (SDGs)

SDG 3: Good Health and Well-being

SDG 10: Reduced Inequality

General Description of the AI tool

BioAutoML automatically runs an end-to-end ML pipeline that can be effectively employed by non-experts. To the best of our knowledge, our proposal automates the longest pipeline for biological sequence analysis, encompassing feature engineering, ML algorithm recommendation, and hyperparameter tuning. So far, we have achieved promising results on several problems, such as SARS-CoV-2, anticancer peptides, pro-inflammatory peptides, HIV sequences, and phage virion proteins. BioAutoML lowers the barrier for non-experts, democratizing ML in life sciences.

Github, open data repository

https://github.com/Bonidia/BioAutoML

Relevant Research and Publications

1 — [IF 2021: 13.994] BONIDIA, ROBSON P; SANTOS, ANDERSON P AVILA; DE ALMEIDA, BRENO L S; STADLER, PETER F; DA ROCHA, ULISSES N; SANCHES, DANILO S; DE CARVALHO, ANDRÉ C P L F. BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria. Briefings in Bioinformatics, v. 1, p. 1-13, 2022.

2 — (Core of BioAutoML, fundamental for its operation) [IF 2021: 13.994] BONIDIA, ROBSON P; DOMINGUES, DOUGLAS S; SANCHES, DANILO S; DE CARVALHO, ANDRÉ C P L F. MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors. Briefings in Bioinformatics, v. 1, p. 1-10, 2022.

3 — [IF 2021: 2.738] BONIDIA, ROBSON P; SANTOS, ANDERSON P AVILA; DE ALMEIDA, BRENO L S; STADLER, PETER F; DA ROCHA, ULISSES N; SANCHES, DANILO S; DE CARVALHO, ANDRÉ C P L F. Information Theory for Biological Sequence Classification: A Novel Feature Extraction Technique Based on Tsallis Entropy. Entropy, v. 24, p. 1398, 2022.

4 — Google Latin America Research Awards (LARA), Google, 2021. Project: BioAutoML. Elected by LARA-Google among the 24 most promising ideas in Latin America (24 awarded projects, from a base of 700 submissions).

5 — BioAutoML — Finalists (Top 15 of 82) — Falling Walls Lab Brazil 2022, DWIH São Paulo, Falling Walls Foundation, DAAD The German Center for Science and Innovation.

For other solutions developed using BioAutoML, explore our website and GitHub repository. These resources showcase diverse applications and advancements made possible by BioAutoML's innovative AI capabilities.

Link-1: https://bonidia.github.io/BioAutoML-WP/
Link-2: http://autoaipandemics.icmc.usp.br/
Link-3: https://github.com/Bonidia/

Needs

Funding

Public Exposure

HPC resources and/or Cloud Computing Services

CONTACT

International Research Centre
on Artificial Intelligence (IRCAI)
under the auspices of UNESCO 

Jožef Stefan Institute
Jamova cesta 39
SI-1000 Ljubljana

info@ircai.org
ircai.org

FOLLOW US

The designations employed and the presentation of material throughout this website do not imply the expression of any opinion whatsoever on the part of UNESCO concerning the legal status of any country, territory, city or area of its authorities, or concerning the delimitation of its frontiers or boundaries.

PRIVACY  POLICY