Spaces:
Running
on
Zero
Running
on
Zero
Commit
·
d555d15
1
Parent(s):
f89165d
Upd ASR and TTS in README
Browse files
README.md
CHANGED
|
@@ -50,6 +50,11 @@ tags:
|
|
| 50 |
- Responses translated back to original language
|
| 51 |
- Powered by DeepSeek-R1-8B for translation
|
| 52 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 53 |
### ⚙️ **Advanced Configuration**
|
| 54 |
- Customizable generation parameters (temperature, top-p, top-k)
|
| 55 |
- Adjustable retrieval settings (top-k, merge threshold)
|
|
@@ -74,6 +79,8 @@ tags:
|
|
| 74 |
|
| 75 |
- **Medical Models**: MedSwin/MedSwin-7B-SFT, MedSwin-7B-KD, MedSwin-Merged-TA-SFT-0.7
|
| 76 |
- **Translation Model**: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
|
|
|
|
|
|
|
| 77 |
- **Embedding Model**: abhinand/MedEmbed-large-v0.1 (domain-tuned medical embeddings)
|
| 78 |
- **RAG Framework**: LlamaIndex with hierarchical node parsing
|
| 79 |
- **Web Search**: DuckDuckGo with content extraction and summarization
|
|
|
|
| 50 |
- Responses translated back to original language
|
| 51 |
- Powered by DeepSeek-R1-8B for translation
|
| 52 |
|
| 53 |
+
### 🎤 **Voice Features**
|
| 54 |
+
- **Speech-to-Text**: Microphone icon for voice input transcription using OpenAI Whisper Large-v3-Turbo
|
| 55 |
+
- **Text-to-Speech**: Speaker icon in responses to generate voice output using Maya1 TTS model
|
| 56 |
+
- Both models preloaded on startup for instant voice interactions
|
| 57 |
+
|
| 58 |
### ⚙️ **Advanced Configuration**
|
| 59 |
- Customizable generation parameters (temperature, top-p, top-k)
|
| 60 |
- Adjustable retrieval settings (top-k, merge threshold)
|
|
|
|
| 79 |
|
| 80 |
- **Medical Models**: MedSwin/MedSwin-7B-SFT, MedSwin-7B-KD, MedSwin-Merged-TA-SFT-0.7
|
| 81 |
- **Translation Model**: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
|
| 82 |
+
- **Speech-to-Text**: openai/whisper-large-v3-turbo
|
| 83 |
+
- **Text-to-Speech**: maya-research/maya1
|
| 84 |
- **Embedding Model**: abhinand/MedEmbed-large-v0.1 (domain-tuned medical embeddings)
|
| 85 |
- **RAG Framework**: LlamaIndex with hierarchical node parsing
|
| 86 |
- **Web Search**: DuckDuckGo with content extraction and summarization
|