App Submission: Kokoro (#2498)

2025-05-12 19:30:42 +02:00 · 2025-04-18 11:17:56 +02:00 · 2025-04-18 11:17:56 +02:00 · ee1d5b5442
commit ee1d5b5442
parent 0914efd7d6
2 changed files with 45 additions and 0 deletions
--- a/kokoro/docker-compose.yml
+++ b/kokoro/docker-compose.yml
@ -0,0 +1,12 @@
+version: '3.7'
+
+services:
+  app_proxy:
+    environment:
+      APP_HOST: kokoro_web_1
+      APP_PORT: 8880
+
+  web:
+    image: ghcr.io/remsky/kokoro-fastapi-cpu:v0.2.3pre@sha256:3992367435b059cab86f3417b094f32f352e201434af446024282364bdb0eca3
+    user: "1000:1000"
+    restart: on-failure
--- a/kokoro/umbrel-app.yml
+++ b/kokoro/umbrel-app.yml
@ -0,0 +1,33 @@
+manifestVersion: 1
+id: kokoro
+name: Kokoro
+tagline: An intelligent, high-quality TTS solution
+category: ai
+version: "0.2.2"
+port: 8877
+description: >-
+  Kokoro is an advanced Text-to-Speech (TTS) model that delivers impressive speech quality with only 82 million parameters, making it competitive with much larger and more resource-intensive models. Despite its relatively compact architecture, Kokoro effectively transforms text into clear, natural-sounding speech, making it an excellent choice for applications relying on speech synthesis. The model has been specifically designed to ensure high efficiency and fast processing, making it suitable for both resource-constrained environments and production systems. In comparison to traditional TTS models, which often require substantial computational resources, Kokoro offers a more cost-effective and faster alternative without compromising the quality of speech output.
+
+
+  Its lightweight architecture ensures that Kokoro can be deployed even on less powerful devices, making it easier to integrate into various applications. Developers can use Kokoro in a wide range of projects, whether for virtual assistants, interactive systems, or enhancing accessibility. The model not only provides accurate and intelligible speech, but also introduces emotional nuances that enhance the user experience. With its flexibility and ability to be applied across diverse scenarios, Kokoro is a valuable resource for anyone seeking an efficient, lightweight, and powerful speech synthesis solution in their projects.
+
+
+  ⚠️ This app only works in Chromium-based browsers (e.g., Chrome, Edge, Brave) and is available at "umbrel.local:8877/web/". Please note that the app is approximately 4GB in size, so the loading process may take a few moments.
+
+
+  ⚙️ The API is available at "umbrel.local:8877", and the API documentation can be found at "umbrel.local:8877/docs".
+developer: Hexgrad
+website: https://kokorotts.net/
+submitter: dennysubke
+submission: https://github.com/getumbrel/umbrel-apps/pull/2498
+repo: https://github.com/remsky/Kokoro-FastAPI
+support: https://github.com/remsky/Kokoro-FastAPI/issues
+gallery:
+  - 1.jpg
+  - 2.jpg
+  - 3.jpg
+releaseNotes: ""
+dependencies: []
+path: "/web"
+defaultUsername: ""
+defaultPassword: ""