App Submission: Kokoro (#2498)

This commit is contained in:
Denny 2025-04-18 11:17:56 +02:00 committed by GitHub
parent 0914efd7d6
commit ee1d5b5442
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
2 changed files with 45 additions and 0 deletions

12
kokoro/docker-compose.yml Normal file
View File

@ -0,0 +1,12 @@
version: '3.7'
services:
app_proxy:
environment:
APP_HOST: kokoro_web_1
APP_PORT: 8880
web:
image: ghcr.io/remsky/kokoro-fastapi-cpu:v0.2.3pre@sha256:3992367435b059cab86f3417b094f32f352e201434af446024282364bdb0eca3
user: "1000:1000"
restart: on-failure

33
kokoro/umbrel-app.yml Normal file
View File

@ -0,0 +1,33 @@
manifestVersion: 1
id: kokoro
name: Kokoro
tagline: An intelligent, high-quality TTS solution
category: ai
version: "0.2.2"
port: 8877
description: >-
Kokoro is an advanced Text-to-Speech (TTS) model that delivers impressive speech quality with only 82 million parameters, making it competitive with much larger and more resource-intensive models. Despite its relatively compact architecture, Kokoro effectively transforms text into clear, natural-sounding speech, making it an excellent choice for applications relying on speech synthesis. The model has been specifically designed to ensure high efficiency and fast processing, making it suitable for both resource-constrained environments and production systems. In comparison to traditional TTS models, which often require substantial computational resources, Kokoro offers a more cost-effective and faster alternative without compromising the quality of speech output.
Its lightweight architecture ensures that Kokoro can be deployed even on less powerful devices, making it easier to integrate into various applications. Developers can use Kokoro in a wide range of projects, whether for virtual assistants, interactive systems, or enhancing accessibility. The model not only provides accurate and intelligible speech, but also introduces emotional nuances that enhance the user experience. With its flexibility and ability to be applied across diverse scenarios, Kokoro is a valuable resource for anyone seeking an efficient, lightweight, and powerful speech synthesis solution in their projects.
⚠️ This app only works in Chromium-based browsers (e.g., Chrome, Edge, Brave) and is available at "umbrel.local:8877/web/". Please note that the app is approximately 4GB in size, so the loading process may take a few moments.
⚙️ The API is available at "umbrel.local:8877", and the API documentation can be found at "umbrel.local:8877/docs".
developer: Hexgrad
website: https://kokorotts.net/
submitter: dennysubke
submission: https://github.com/getumbrel/umbrel-apps/pull/2498
repo: https://github.com/remsky/Kokoro-FastAPI
support: https://github.com/remsky/Kokoro-FastAPI/issues
gallery:
- 1.jpg
- 2.jpg
- 3.jpg
releaseNotes: ""
dependencies: []
path: "/web"
defaultUsername: ""
defaultPassword: ""