2025 | Education | Excellent | Germany | SDG3 | SDG4
Phonomatics

Organisation Name

Leibniz University Hannover

Industry

Education

Organisation Website

https://www.uni-hannover.de/de/

Country

Germany

Sustainable Development Goals (SDGs)

SDG 3: Good Health and Well-being

SDG 4: Quality Education

General Description of the AI tool

The aim of Phonomatics is to develop an automatic speech recognition (ASR) software that transcribes child speech recordings from natural communication situations (e.g., looking at a picture book with another child or adult) and evaluates them linguistically (vocabulary, grammar, articulation). The software output will be various speech and language measures aggregated for different professional users and assessment purpuses. Therefore, our software can be used in educational and health care settings (primarly at kindergarten age) wherever child language development is being monitored.

Github, open data repository

https://www.tnt.uni-hannover.de/en/project/talc/

Relevant Research and Publications

1. Rumberg, L., Gebauer, C., Ehlert, H., Wallbaum, M., Bornholt, L., Ostermann, J. & Lüdtke, U. (2022). kidsTALC: A Corpus of 3- to 11-year-old German Children’s Connected Natural Speech. Proceedings INTERSPEECH — 23th Annual Conference of the International Speech Communication Association, 5160-5164. (Our open access database to promote research on child language technology)

2. Christopher Gebauer, Lars Rumberg, Hanna Ehlert, Ulrike Lüdtke, Jörn Ostermann
Exploiting Diversity of Automatic Transcripts from Distinct Speech Recognition Techniques for Children’s Speech
Proceedings INTERSPEECH 2023 – 24th Annual Conference of the International Speech Communication Association, August 2023
(pdf) BibTeX (Work on ASR of child speech as a prerequisite for a detailed speech assessment)

3. Rumberg, L., Gebauer, C., Ehlert, H., Wallbaum, M., Lüdtke, U. & Ostermann, J. (2023). Uncertainty Estimation for Connectionist Temporal Classification Based Automatic Speech Recognition. Proceedings INTERSPEECH, 4583-4587. (Uncertainty estimation as a method to provide transparent automated outputs)

4. Christopher Gebauer, Lars Rumberg, Lars Köhn, Hanna Ehlert, Edith Beaulac, Jörn Ostermann
Grammatical Error Detection on Spontaneous Children’s Speech Using Iterative Pseudo Labeling
to appear in Proceedings INTERSPEECH 2025 – 26th Annual Conference of the International Speech Communication Association, ISCA, 2025 (exploration of automated linguistic assessment)

5. Lüdtke, U., Bornman, J., de Wet, F., Heid, U., Ostermann, J., Rumberg, L., van der Linde, J., & Ehlert, H. (2023). Multi-disciplinary perspectives on automatic analysis of children's language samples: Where do we go from here? Folia Phoniatirca et Logopaedica, 75, 1-12. (Interdiscuplinary, international review on child speech technology with the joint German and South African team)

Needs

Personnel

CONTACT

International Research Centre
on Artificial Intelligence (IRCAI)
under the auspices of UNESCO 

Jožef Stefan Institute
Jamova cesta 39
SI-1000 Ljubljana

info@ircai.org
ircai.org

FOLLOW US

The designations employed and the presentation of material throughout this website do not imply the expression of any opinion whatsoever on the part of UNESCO concerning the legal status of any country, territory, city or area of its authorities, or concerning the delimitation of its frontiers or boundaries.

PRIVACY  POLICY