CodeVoice Documentation

Overview

Introduction

CodeVoice is a text-to-speech platform for generating natural voice output through a clean dashboard and API workflow. It combines an interactive UI for testing and account management with programmatic access for application integration.

The platform is built around three core capabilities: generating speech from text, discovering available voices and languages, and tracking account-level credit consumption. That makes it suitable both for fast prototyping and for embedding speech generation into production products.

Typical use cases: product narration, internal automation, accessibility features, scripted content generation, and customer-facing voice workflows.

Quickstart

Getting Started

Use the following sequence to go from account creation to your first generated audio response.

1

Create an account

Sign up through the CodeVoice dashboard so you can access your workspace, credits, and API key management area.

2

Get your API key

Open the API and Credits section to find the personal key tied to your account. This key is required for authenticated requests.

3

Open the TTS Playground

Use the dashboard playground to choose a voice, enter text, and preview the speech generation workflow before integrating it into your app.

4

Generate your first speech request

Send a request with text, voice, and language metadata. Once the request succeeds, use the returned audio URL in your frontend or pipeline.

Security

Authentication

All authenticated API requests should include your API key in the request headers. Treat this key like a secret and never expose it in public client code unless your architecture explicitly allows it.

X-API-Key: YOUR_API_KEY
Authorization: Bearer YOUR_JWT_TOKEN

Developer API calls usually use X-API-Key. Logged-in dashboard users can also call protected routes with a JWT bearer token from /api/auth/login.

For frontend integrations, prefer routing requests through your own secure backend or edge layer when you need stronger control over exposure, rate limiting, or user-level enforcement.

API Reference

Text to Speech API

The primary speech endpoint is POST /v1/tts. It queues a generation job and returns a job_id that you then poll through the status endpoint until the audio file is ready.

Queue a job

POST /v1/tts
Content-Type: application/json
X-API-Key: YOUR_API_KEY

{
  "text": "Hello world from CodeVoice",
  "voice": "ryan"
}

Queue response

{
  "job_id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "queued",
  "user_type": "registered",
  "tier": 1,
  "message": "Free credit used. 59 remaining."
}

curl -X POST http://127.0.0.1:8000/v1/tts \
  -H "Content-Type: application/json" \
  -H "X-API-Key: YOUR_API_KEY" \
  -d "{\"text\":\"Hello world from CodeVoice\",\"voice\":\"ryan\"}"

Field	Type	Description
`text`	string	The text content to synthesize into speech.
`voice`	string	The selected voice identifier from the voices catalog.
`language`	string	Not required for `/v1/tts`. The current request model expects `text`, `voice`, and optional `api_key` in the body.

Workflow

Speech Job Flow

CodeVoice TTS is asynchronous. After queueing a job, poll the status endpoint until the job is completed, then download or stream the audio from the returned path.

1

Queue speech generation

Send POST /v1/tts and capture the returned job_id.

2

Poll job status

Call GET /tts/status/{job_id} until the response changes to completed.

3

Fetch audio

Use the returned audio_url, usually /tts/audio/{job_id}, to load or download the WAV file.

GET /tts/status/550e8400-e29b-41d4-a716-446655440000

{
  "status": "completed",
  "audio_url": "/tts/audio/550e8400-e29b-41d4-a716-446655440000"
}

curl -L http://127.0.0.1:8000/tts/audio/550e8400-e29b-41d4-a716-446655440000 --output voice.wav

Catalog

Voices API

The voices endpoint is used to discover the list of voices available to your account and identify which voices support the languages or quality levels your product requires.

Voice list

Retrieve all available voices and surface them in a selector, settings panel, or onboarding workflow.

Language support

Map voices to supported language labels and group options by region or voice style when rendering in the UI.

Recommended pattern: cache the voice list in your frontend state and provide filters for language, gender, or quality to keep selection fast for end users.

GET /v1/voices

{
  "voices": [
    {
      "name": "ryan",
      "gender": "male",
      "language": "en-US",
      "language_name": "English",
      "description": "Voice: ryan",
      "quality": "high"
    }
  ]
}

Speech to Text

Speech to Text API

CodeVoice also exposes STT utilities for language discovery, uploaded file transcription, and live-browser completion accounting.

Endpoint	Method	Purpose
`/api/stt/languages`	GET	Returns supported STT language options for the UI or your own client.
`/transcribe`	POST	Uploads an audio file and returns a transcription response.
`/api/stt/live/complete`	POST	Deducts one credit after a successful live-browser transcript is completed.

curl -X POST http://127.0.0.1:8000/transcribe \
  -H "X-API-Key: YOUR_API_KEY" \
  -F "language=en-US" \
  -F "file=@sample.wav"

Accounts

Account API

The dashboard and auth experience use JWT-based account endpoints. These are useful when you want your product to create accounts, log users in, and surface dashboard usage information in-app.

Endpoint	Method	Description
`/api/auth/register`	POST	Create a new user account and return a JWT token plus API key.
`/api/auth/login`	POST	Log in an existing user and return a JWT token plus API key.
`/api/auth/me`	GET	Return the current authenticated user and credit summary.
`/api/user/dashboard`	GET	Return dashboard-ready user, credit, and usage data.

POST /api/auth/login
Content-Type: application/json

{
  "email": "user@example.com",
  "password": "your-password"
}

Usage

Credits System

CodeVoice uses a credit-based model to track usage. For registered users, one successful generation or billed STT completion consumes one credit. The dashboard separates free monthly credits from paid credits so you can see how usage is allocated over time.

1 request equals 1 credit.
Registered users receive monthly free credits that reset each month.
Paid credits are used when free monthly allocation is exhausted.
API-only users can be configured by admins with monthly limits, extra credits, or unlimited mode.
Usage statistics in the dashboard help you monitor requests, success rate, and recent activity.

Credit Type	Behavior	Use Case
Free credits	Reset monthly according to plan limits	Exploration, testing, low-volume usage
Paid credits	Used for extended capacity after free usage	Production workloads, higher-volume generation
API-only monthly credits	Admin-managed monthly included balance, optionally unlimited	Developer integrations, partner access, managed API accounts

Reliability

Error Handling

When a request cannot be completed, the API returns an error response that should be surfaced clearly in your application. Build your integration so it distinguishes between auth failures, validation problems, and account usage issues.

{
  "detail": {
    "error": "insufficient_credits",
    "message": "Monthly API credit limit reached. Please contact admin or enable unlimited access.",
    "user_type": "api_only"
  }
}

Recommended handling

Show user-friendly messages, preserve the original request input, and expose retry options where appropriate.

Useful safeguards

Log request failures, set client-side timeouts, and track error frequency to identify spikes or integration regressions.

Integration

SDK / Integration

You can integrate CodeVoice in any frontend or backend stack using standard HTTP requests. The examples below show the recommended server-side proxy pattern, a direct backend Python client, and a frontend polling flow for completed TTS jobs.

Node / backend proxy

app.post("/api/tts", async (req, res) => {
  const response = await fetch("http://127.0.0.1:8000/v1/tts", {
    method: "POST",
    headers: {
      "Content-Type": "application/json",
      "X-API-Key": process.env.CODEVOICE_API_KEY
    },
    body: JSON.stringify({
      text: req.body.text,
      voice: req.body.voice || "ryan"
    })
  });

  const data = await response.json();
  res.status(response.status).json(data);
});

Frontend polling example

const queued = await fetch("/api/tts", {
  method: "POST",
  headers: {
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    text: "Hello from the browser",
    voice: "ryan"
  })
}).then(res => res.json());

const poll = async (jobId) => {
  while (true) {
    const status = await fetch(`/tts/status/${jobId}`).then(res => res.json());
    if (status.status === "completed") return status.audio_url;
    if (status.status === "failed") throw new Error(status.error || "TTS failed");
    await new Promise(resolve => setTimeout(resolve, 1500));
  }
};

Python integration

import requests
import time

BASE_URL = "http://127.0.0.1:8000"
HEADERS = {"X-API-Key": "YOUR_API_KEY"}

job = requests.post(
    f"{BASE_URL}/v1/tts",
    headers={**HEADERS, "Content-Type": "application/json"},
    json={"text": "Hello from Python", "voice": "ryan"},
).json()

job_id = job["job_id"]
while True:
    status = requests.get(f"{BASE_URL}/tts/status/{job_id}").json()
    if status["status"] == "completed":
        audio = requests.get(f"{BASE_URL}{status['audio_url']}")
        open("voice.wav", "wb").write(audio.content)
        break
    if status["status"] == "failed":
        raise RuntimeError(status.get("error", "TTS failed"))
    time.sleep(1.5)

For larger apps, wrap request calls in a dedicated API client layer so auth headers, retries, and response parsing stay consistent across your codebase.

Recommended architecture: keep your CodeVoice API key on your own backend or edge service, expose only your own product endpoints to browsers, and use JWT-based dashboard routes only for signed-in end users.

Support

FAQ

How do credits work?

Each generation request consumes one credit. Free credits reset monthly, while paid credits extend usage when your monthly limit is exceeded.

Which languages are supported?

Supported languages depend on the voice catalog available to your account. Use the voices endpoint and dashboard filters to inspect the available language combinations.

What audio formats can I expect?

The platform returns generated audio via a URL response. Your implementation should use the returned file metadata and playback behavior supported by your environment.

Build production-ready voice experiences with CodeVoice

What CodeVoice does

Who it is for

Introduction

Getting Started

Create an account

Get your API key

Open the TTS Playground

Generate your first speech request

Authentication

Text to Speech API

Queue a job

Queue response

Speech Job Flow

Queue speech generation

Poll job status

Fetch audio

Voices API

Voice list

Language support

Speech to Text API

Account API

Credits System

Error Handling

Recommended handling

Useful safeguards

SDK / Integration

Node / backend proxy

Frontend polling example

Python integration

FAQ

How do credits work?

Which languages are supported?

What audio formats can I expect?