Dataset Multiple Languages

1don MSN

MLCommons and Hugging Face team up to release massive speech dataset for AI research

The nonprofit AI safety org MLCommons has teamed up with Hugging Face to release a public domain dataset of speech recordings ...

InfoQ5d

Synthetic Data Generator Simplifies Dataset Creation with Large Language Models

Hugging Face has introduced the Synthetic Data Generator, a new tool leveraging Large Language Models (LLMs), that offers a streamlined, no-code approach to creating custom datasets. The tool ...

Singularity Hub17d

Meta’s New AI Translates Speech in Real Time Across More Than 100 Languages

The translation part of the AI was pre-trained on a massive dataset containing 4.5 million hours of spoken audio in multiple languages. This initial step helped the AI “learn patterns in the data, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Trending now