ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper โข 2512.07843 โข Published 16 days ago โข 12
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper โข 2510.08697 โข Published Oct 9 โข 36
ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting Paper โข 2504.20630 โข Published Apr 29 โข 9
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head Paper โข 2304.12995 โข Published Apr 25, 2023
Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2 Paper โข 2401.17619 โข Published Jan 31, 2024 โข 1
SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction Paper โข 2406.10911 โข Published Jun 16, 2024
Muskits-ESPnet: A Comprehensive Toolkit for Singing Voice Synthesis in New Paradigm Paper โข 2409.07226 โข Published Sep 11, 2024 โข 1
WritingBench: A Comprehensive Benchmark for Generative Writing Paper โข 2503.05244 โข Published Mar 7 โข 20
view post Post 46372 Google drops Gemini 2.0 Flash Thinkinga new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and morenow available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat See translation 5 replies ยท ๐ 12 12 ๐ฅ 6 6 ๐ 3 3 ๐ 2 2 + Reply
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models Paper โข 2412.10117 โข Published Dec 13, 2024 โข 3
view post Post 45532 QwQ-32B-Preview is now available in anychatA reasoning model that is competitive with OpenAI o1-mini and o1-previewtry it out: https://huggingface.co/spaces/akhaliq/anychat See translation 1 reply ยท โค๏ธ 3 3 ๐ 2 2 + Reply
view post Post 5023 New model drop in anychatallenai/Llama-3.1-Tulu-3-8B is now availabletry it here: https://huggingface.co/spaces/akhaliq/anychat See translation ๐ฅ 3 3 ๐ 1 1 + Reply
view post Post 3790 anychatsupports chatgpt, gemini, perplexity, claude, meta llama, grok all in one apptry it out there: https://huggingface.co/spaces/akhaliq/anychat โค๏ธ 7 7 ๐ 4 4 ๐ฅ 2 2 + Reply
ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration Paper โข 2409.09506 โข Published Sep 14, 2024 โข 4
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper โข 2408.16532 โข Published Aug 29, 2024 โข 50
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency Paper โข 2408.04708 โข Published Aug 8, 2024 โข 9
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency Paper โข 2408.04708 โข Published Aug 8, 2024 โข 9