Discover more from Data Machina
Data Machina #249
GenAI Music. MusicGen. MusicFX. Stable Audio 2. Suno V3. Udio. Rerank3 Model. Parler TTS. nanoLLaVA VL Model. Text2SQL DuckDB-NSQL-7B. aiXcoder-7B
Generative AI Music. In the last year or so, Generative AI Music has improved massively. Although early days, today you can generate some pretty decent, short duration music of all kinds with AI. If you like creating music and AI, here is a list of interesting Generative AI music stuff.
Facebook AIR MusicGen. Probably one of the pioneering models in AI quality music generation. MusicGen has sparked a whole universe of MusicGen derivative models of all kinds, and it’s the model behind many musicgen apps. The model is based on a single stage auto-regressive Transformer model, and unlike Google LM, MusicGen doesn't require a self-supervised semantic representation. Repo and demos here: MusicGen: Simple and Controllable AI Music Generation
Mulbert. One of the early AI musicgen startups, Mulbert is an app for generating high-quality, royalty-free music with AI. Thy this Mulbert text-to-music notebook and get the app here.
Stable Audio 2.0. Recently introduced by Stability AI, Stable Audio 2.0 lets you generate high-quality, full tracks from text & audio with coherent musical structure up to three minutes in length at 44.1kHz stereo. Checkout the blogpost, demos, trial: Introducing Stable Audio 2.0
MusicLang is an app for controllable music generation with AI, mostly oriented to artists and music producers. The MusicLang team recently released MusicLang Predict, your controllable music copilot (repo). You can Try MusicLang here.
The MusicLang Tokeniser. An interesting post explaining how tokenization works inside MusicLang and its capacity to afford users profound control over the musical content generated by transformer models. The MusicLang tokenizer : Toward controllable symbolic music generation.
Glycol. A foundation for some specialised musicgen model. If you love coding and music this is pretty cool. Glycol is an open source, next-gen language for generating music with code. Get Glycol from this repo.
RaveForce Agent is a Python package under MIT license that allows you to define your musical tasks in Python with Glicol syntax, and train an agent to do the task with APIs similar to the OpenAI Gym. Get ReveForce here.
Google MusicFX is powered by Google MusicLM and AudioLM. Simple with no frills but good quality. A neat feature is DJ Mode, that enables you to generate a real-time stream of music by adding and adjusting musical prompts to evolve the music live. You can try Google MusicFX here
Suno AI V3 The latest version of Suno enables you to generate two-minute, radio-quality music from text prompts in just a few seconds. The model behind Suno combines a proprietary AI musicgen model and ChatGPT for the lyrics. Sun has some cool features and you can get some decent outputs. Try Suno.ai V3 here
Udio. Recently released, it’s super trending in the musicgen scene now. Suno enables you to create music from simple text prompts by specifying topics, genres, and other descriptors which are then transformed into professional quality tracks. Udio it’s pretty impressive and you can generate some amazing music output. I really like it! Try Udio here. You can also watch this tutorial on how to use Audio.
.Have a nice week.
10 Link-o-Troned
the ML Pythonista
Deep & Other Learning Bits
AI/ DL ResearchDocs
MLOps Untangled
ML Datasets & Stuff
Postscript, etc
Tips? Suggestions? Feedback? email Carlos
Curated by @ds_ldn in the middle of the night.
Subscribe to Data Machina
A weekly deep dive into the latest AI / ML research, projects & repos.