Chat - AI Voice Chat with 3D Avatars

A real-time voice chat application featuring 3D animated avatars powered by AI and high-quality text-to-speech synthesis.

Features

🎭 3D Animated Avatars

Interactive 3D avatars (Julia & David) using TalkingHead library
Real-time lip sync synchronized with speech
Smooth animations and natural facial expressions
Avatar voice customization with 50+ Kokoro TTS voices

🗣️ Voice Chat

Voice Input: Browser-based speech recognition with 1-second auto-submit
Text Input: Traditional text chat with Enter-to-submit
AI Responses: Streaming AI chat powered by local Ollama LLMs
Voice Output: High-quality Kokoro TTS with optimized server-side synthesis

🎨 User Interface

Clean, modern Teams-style chat interface
Dark mode support
Real-time message streaming
Voice settings with American/British English options
Conversation history with auto-scrolling

🔐 Authentication

Simple human verification (3-second timer + checkbox)
Session management
Auto-redirect to chat when authenticated

Technology Stack

Backend: Elixir + Phoenix Framework + LiveView
Frontend: JavaScript + Three.js + TalkingHead
AI: Ollama (local LLM support)
TTS: Kokoro TTS (ONNX) with PythonX integration
Voice: Browser Web Speech API
Database: PostgreSQL
Deployment: nginx reverse proxy with SSL

Built with Claude Code as the vibe coding partner

Quick Start

Prerequisites

Elixir 1.14+
PostgreSQL
Node.js 18+
Python 3.x (for Kokoro TTS)

Installation

Install dependencies:

mix setup

Install Python TTS dependencies:

pip install kokoro-onnx soundfile

Start Phoenix server:

mix phx.server

Visit localhost:4000

Configuration

Ollama Setup

Install and run Ollama locally:

# macOS
brew install ollama
ollama serve

# Pull a model (e.g., llama2)
ollama pull llama2

Kokoro TTS

The Kokoro TTS model is pre-loaded at startup in a GenServer for optimal performance. Models are located in priv/models/.

Available Voices

American English: af_bella, af_nova, af_sarah, am_adam, am_fenrir, etc.
British English: bf_alice, bf_emma, bm_george, bm_lewis, etc.
See CLAUDE.md for full voice list

Production Deployment

SSL Certificates

SSL certificates are stored in priv/certs/. The nginx reverse proxy configuration references these certificates.

Important: nginx must be run with sudo to bind to privileged ports (80 and 443):

sudo nginx

To stop nginx:

sudo nginx -s stop

To reload nginx configuration:

sudo nginx -s reload

DNS Configuration

Ensure both A (IPv4) and AAAA (IPv6) records are configured for your domain.

Architecture Highlights

Persistent TTS Server

Chat.TTSServer GenServer keeps Kokoro model loaded in memory
Dramatic latency reduction compared to per-request initialization
Handles concurrent synthesis requests efficiently

LiveView Real-time Updates

Server-sent events for streaming AI responses
Client-side hooks for avatar control and voice input
Optimized DOM updates with phx-update="ignore" for Three.js canvas

Voice Pipeline

Speech Recognition → LiveView → Ollama LLM → Kokoro TTS → Avatar Playback

Project Structure

lib/
  chat/
    tts.ex              # TTS interface
    tts_server.ex       # Persistent Kokoro GenServer
    conversations.ex    # Chat logic & system prompts
  chat_web/
    live/chat_live/     # Main chat interface
    controllers/
      tts_controller.ex # TTS API endpoint
assets/
  js/
    app.js             # Main JS with hooks
    avatar3.js         # 3D avatar integration
priv/
  models/            # Kokoro TTS models
  static/avatars/    # Avatar GLB files

Learn More

Phoenix Framework: https://www.phoenixframework.org/
TalkingHead: https://github.com/met4citizen/TalkingHead
Kokoro TTS: https://huggingface.co/hexgrad/Kokoro-82M
Ollama: https://ollama.ai/

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
config		config
lib		lib
priv		priv
rel/overlays/bin		rel/overlays/bin
test		test
.formatter.exs		.formatter.exs
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
TEST_GUIDE.md		TEST_GUIDE.md
claude.md		claude.md
mix.exs		mix.exs
mix.lock		mix.lock
package-lock.json		package-lock.json
playwright_debug.js		playwright_debug.js
yo.wav		yo.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chat - AI Voice Chat with 3D Avatars

Features

🎭 3D Animated Avatars

🗣️ Voice Chat

🎨 User Interface

🔐 Authentication

Technology Stack

Quick Start

Prerequisites

Installation

Configuration

Ollama Setup

Kokoro TTS

Available Voices

Production Deployment

SSL Certificates

DNS Configuration

Architecture Highlights

Persistent TTS Server

LiveView Real-time Updates

Voice Pipeline

Project Structure

Learn More

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

GenericJam/voice_chat

Folders and files

Latest commit

History

Repository files navigation

Chat - AI Voice Chat with 3D Avatars

Features

🎭 3D Animated Avatars

🗣️ Voice Chat

🎨 User Interface

🔐 Authentication

Technology Stack

Quick Start

Prerequisites

Installation

Configuration

Ollama Setup

Kokoro TTS

Available Voices

Production Deployment

SSL Certificates

DNS Configuration

Architecture Highlights

Persistent TTS Server

LiveView Real-time Updates

Voice Pipeline

Project Structure

Learn More

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages