Skip to content

Commit b9fae43

Browse files
committed
feat: add gemini-2.0 support!
1 parent 6bd0a0f commit b9fae43

File tree

6 files changed

+15
-18
lines changed

6 files changed

+15
-18
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -110,8 +110,8 @@ docker run -d -p 8501:8501 --gpus all videolingo
110110

111111
## API
112112
VideoLingo supports OpenAI-Like API format and various dubbing interfaces:
113-
- `claude-3-5-sonnet-20240620`, `gemini-1.5-pro-002`, `gpt-4o`, `deepseek-coder`, `Qwen2.5-72B-Instruct`, ... (sorted by performance)
114-
- `azure-tts`, `openai-tts`, `siliconflow-fishtts`, `fish-tts`, `GPT-SoVITS`, `edge-tts`, `custom-tts`(edit yourself in custom_tts.py)
113+
- `claude-3-5-sonnet-20240620`, **`gemini-2.0-flash-exp`**, `gpt-4o`, `deepseek-coder`, ... (sorted by performance)
114+
- `azure-tts`, `openai-tts`, `siliconflow-fishtts`, **`fish-tts`**, `GPT-SoVITS`, `edge-tts`, `*custom-tts`(ask gpt to help you define in custom_tts.py)
115115

116116
> **Note:** VideoLingo is now integrated with [302.ai](https://gpt302.saaslink.net/C2oHR9), **one API KEY** for both LLM and TTS! Also supports fully local deployment using Ollama for LLM and Edge-TTS for dubbing, no cloud API required!
117117

config.yaml

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
# * Settings marked with * are advanced settings that won't appear in the Streamlit page and can only be modified manually in config.py
2-
version: "2.0.4"
2+
version: "2.1.2"
33
## ======================== Basic Settings ======================== ##
44
# API settings
55
api:
66
key: 'YOUR_API_KEY'
77
base_url: 'https://api.302.ai'
8-
model: 'claude-3-5-sonnet-20240620'
8+
model: 'gemini-2.0-flash-exp'
99

1010
# Language settings, written into the prompt, can be described in natural language
1111
target_language: '简体中文'
@@ -133,9 +133,7 @@ allowed_audio_formats:
133133
llm_support_json:
134134
- 'gpt-4o'
135135
- 'gpt-4o-mini'
136-
- 'gemini-1.5-flash-latest'
137-
- 'gemini-1.5-pro-latest'
138-
- 'gemini-1.5-pro-002'
136+
- 'gemini-2.0-flash-exp'
139137
- 'deepseek-coder'
140138

141139
# have problems

docs/pages/docs/start.en-US.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,8 @@ This project requires Large Language Models and TTS. For best quality, please us
77

88
| Recommended Model | Recommended Provider | base_url | Price | Effect |
99
|:-----|:---------|:---------|:-----|:---------|
10-
| claude-3-5-sonnet-20240620 | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | $7.5 / 1M tokens | 🤩 |
10+
| gemini-2.0-flash-exp | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | $0.3 / 1M tokens | 🥳 |
11+
| claude-3-5-sonnet-20240620 | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | $15 / 1M tokens | 🤩 |
1112
| deepseek-coder | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | ¥2 / 1M tokens | 😃 |
1213
| qwen2.5-coder:32b | [Ollama](https://ollama.ai) | http://localhost:11434 | Local | 😃 |
1314

@@ -20,7 +21,7 @@ VideoLingo provides multiple TTS integration methods. Here's a comparison (skip
2021
|:---------|:---------|:-----|:-----|:---------|:-----------|
2122
| 🔊 Azure TTS ⭐ | [302AI](https://gpt302.saaslink.net/C2oHR9) | Natural effect | Limited emotions | 🤩 | 😃 |
2223
| 🎙️ OpenAI TTS | [302AI](https://gpt302.saaslink.net/C2oHR9) | Realistic emotions | Chinese sounds foreign | 😕 | 🤩 |
23-
| 🎤 Fish TTS | [302AI](https://gpt302.saaslink.net/C2oHR9) | Authentic native | Limited official models | 😂 | 😂 |
24+
| 🎤 Fish TTS | [302AI](https://gpt302.saaslink.net/C2oHR9) | Authentic native | Limited official models | 🤩 | 😂 |
2425
| 🎙️ SiliconFlow FishTTS | [SiliconFlow](https://cloud.siliconflow.cn/i/ttKDEsxE) | Voice Clone | Unstable cloning effect | 😃 | 😃 |
2526
| 🗣 Edge TTS | Local | Completely free | Average effect | 😐 | 😐 |
2627
| 🗣️ GPT-SoVITS | Local | Best voice cloning | Only supports Chinese/English, requires local inference, complex setup | 🏆 | 🚫 |

docs/pages/docs/start.zh-CN.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,8 @@
77

88
| 推荐模型 | 推荐提供商 | base_url | 价格 | 效果 |
99
|:-----|:---------|:---------|:-----|:---------|
10-
| claude-3-5-sonnet-20240620 | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | $7.5 / 1M tokens | 🤩 |
10+
| gemini-2.0-flash-exp | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | $0.3 / 1M tokens | 🥳 |
11+
| claude-3-5-sonnet-20240620 | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | $15 / 1M tokens | 🤩 |
1112
| deepseek-coder | [302AI](https://gpt302.saaslink.net/C2oHR9) | https://api.302.ai | ¥2 / 1M tokens | 😃 |
1213
| qwen2.5-coder:32b | [Ollama](https://ollama.ai) | http://localhost:11434 | 本地 | 😃 |
1314

i18n/README.zh.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -112,8 +112,8 @@ docker run -d -p 8501:8501 --gpus all videolingo
112112

113113
## API
114114
本项目支持 OpenAI-Like 格式的 api 和多种配音接口:
115-
- `claude-3-5-sonnet-20240620`, `gemini-1.5-pro-002`, `gpt-4o`, `deepseek-coder`, `Qwen2.5-72B-Instruct`, ...(按效果排序)
116-
- `azure-tts`, `openai-tts`, `siliconflow-fishtts`, `fish-tts`, `GPT-SoVITS`, `edge-tts`, `custom-tts`(可在custom_tts.py中自行编辑)
115+
- `claude-3-5-sonnet-20240620`, **`gemini-2.0-flash-exp`**, `gpt-4o`, `deepseek-coder`, ...(按效果排序)
116+
- `azure-tts`, `openai-tts`, `siliconflow-fishtts`, **`fish-tts`**, `GPT-SoVITS`, `edge-tts`, `*custom-tts`(ask gpt to help you define in custom_tts.py)
117117

118118
> **注意:** VideoLingo 现已与 [302.ai](https://gpt302.saaslink.net/C2oHR9) 集成,**一个 API KEY** 即可同时支持 LLM 和 TTS!同时也支持完全本地部署,使用 Ollama 作为 LLM 和 Edge-TTS 作为配音,无需云端 API!
119119

i18n/中文/config.yaml

Lines changed: 3 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
# * 标有 * 的设置是高级设置,不会出现在 Streamlit 页面中,只能在 config.py 中手动修改
2-
version: "2.0.4"
2+
version: "2.1.2"
33
## ======================== 基本设置 ======================== ##
44
# API 设置
55
api:
66
key: 'YOUR_API_KEY'
77
base_url: 'https://api.302.ai'
8-
model: 'claude-3-5-sonnet-20240620'
8+
model: 'gemini-2.0-flash-exp'
99

1010
# 语言设置,写入提示词,可以用自然语言描述
1111
target_language: '简体中文'
@@ -133,10 +133,7 @@ allowed_audio_formats:
133133
llm_support_json:
134134
- 'gpt-4o'
135135
- 'gpt-4o-mini'
136-
- 'grok-beta'
137-
- 'gemini-1.5-flash-latest'
138-
- 'gemini-1.5-pro-latest'
139-
- 'gemini-1.5-pro-002'
136+
- 'gemini-2.0-flash-exp'
140137
- 'deepseek-coder'
141138

142139
# 存在问题

0 commit comments

Comments
 (0)