LiamKhoaLe commited on
Commit
d555d15
·
1 Parent(s): f89165d

Upd ASR and TTS in README

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -50,6 +50,11 @@ tags:
50
  - Responses translated back to original language
51
  - Powered by DeepSeek-R1-8B for translation
52
 
 
 
 
 
 
53
  ### ⚙️ **Advanced Configuration**
54
  - Customizable generation parameters (temperature, top-p, top-k)
55
  - Adjustable retrieval settings (top-k, merge threshold)
@@ -74,6 +79,8 @@ tags:
74
 
75
  - **Medical Models**: MedSwin/MedSwin-7B-SFT, MedSwin-7B-KD, MedSwin-Merged-TA-SFT-0.7
76
  - **Translation Model**: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
 
 
77
  - **Embedding Model**: abhinand/MedEmbed-large-v0.1 (domain-tuned medical embeddings)
78
  - **RAG Framework**: LlamaIndex with hierarchical node parsing
79
  - **Web Search**: DuckDuckGo with content extraction and summarization
 
50
  - Responses translated back to original language
51
  - Powered by DeepSeek-R1-8B for translation
52
 
53
+ ### 🎤 **Voice Features**
54
+ - **Speech-to-Text**: Microphone icon for voice input transcription using OpenAI Whisper Large-v3-Turbo
55
+ - **Text-to-Speech**: Speaker icon in responses to generate voice output using Maya1 TTS model
56
+ - Both models preloaded on startup for instant voice interactions
57
+
58
  ### ⚙️ **Advanced Configuration**
59
  - Customizable generation parameters (temperature, top-p, top-k)
60
  - Adjustable retrieval settings (top-k, merge threshold)
 
79
 
80
  - **Medical Models**: MedSwin/MedSwin-7B-SFT, MedSwin-7B-KD, MedSwin-Merged-TA-SFT-0.7
81
  - **Translation Model**: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
82
+ - **Speech-to-Text**: openai/whisper-large-v3-turbo
83
+ - **Text-to-Speech**: maya-research/maya1
84
  - **Embedding Model**: abhinand/MedEmbed-large-v0.1 (domain-tuned medical embeddings)
85
  - **RAG Framework**: LlamaIndex with hierarchical node parsing
86
  - **Web Search**: DuckDuckGo with content extraction and summarization