CMU Sphinx Using Python

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning

Paul Pu Liang, Yiwei Lyu, Xiang Fan, Zetian Wu, Yun Cheng, Jason Wu, Leslie Chen, Peter Wu, Michelle A. Lee, Yuke Zhu, Ruslan Salakhutdinov, Louis-Philippe Morency NeurIPS 2021 Datasets and Benchmarks ...

GitHub

A collection of links and notes on forced alignment tools

Given an audio file containing speech, and the corresponding transcript, computing a forced alignment is the process of determining, for each fragment of the transcript, the time interval (in the ...

Analytics Insight

Top 10 Open Source Python Libraries for Voice Agents in 2025

Open source Python libraries empower developers to build advanced, customizable voice agents with full transparency. Python libraries like Whisper, Rasa, and Transformers lead the 2025 voice ...

Interactive VR Using RAG in Education

In the realm of education, the fusion of Artificial Intelligence (AI) and Virtual Reality (VR) technologies is paving the way for innovative teaching methods and immersive learning experiences.

Frontiers

Computational Sociolinguistics

Over the past decade, a new approach to the study of language variation and change has emerged at the intersection of linguistics and computer science, opening up new ground for research on one of the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results