System Architecture
High-level overview of Faxi's components and data flow
AI Models & Techniques
State-of-the-art AI powering accurate fax interpretation
Multi-Model AI Pipeline
Faxi uses a sophisticated AI pipeline combining multiple specialized models. Each model excels at a specific task, and their outputs are combined to achieve high overall accuracy.
Vision AI (GPT-4 Vision)
Optical Character Recognition and Visual Analysis
Extracts text from fax images including both printed and handwritten content. Uses advanced computer vision to understand document structure, identify form fields, and recognize Japanese characters with high accuracy.
Annotation Detector
Visual Annotation Recognition
Identifies hand-drawn marks on faxes such as checkmarks, circles, arrows, and underlines. Associates annotations with nearby text to understand user intent.
Intent Classifier (Claude)
Natural Language Understanding and Action Extraction
Analyzes extracted text and annotations to determine what action the user wants to perform. Classifies intents (email, shopping, appointment, etc.) and extracts relevant parameters with high confidence.
Processing Pipeline
Image Preprocessing
Enhance image quality, remove noise, correct skew and rotation
Vision Analysis
Extract text regions and identify visual elements
Annotation Detection
Find and classify hand-drawn marks
Intent Extraction
Understand user intent and extract parameters
Confidence Scoring
Assess reliability of each component
Overall Performance
Technology Stack
Modern, scalable technologies powering the Faxi platform
Frontend
Astro
Fast static site generator for optimal performance
TypeScript
Type-safe JavaScript for robust code and better developer experience
Tailwind CSS
Utility-first CSS framework for rapid UI development
Backend
Express.js
Fast, minimalist web framework for Node.js
PostgreSQL
Robust relational database for storing users, jobs, and metrics
Redis
In-memory data store for job queues and caching
AI & Machine Learning
Claude (Anthropic)
Advanced language model for intent extraction and NLU
GPT-4 Vision
Multimodal AI for OCR, handwriting recognition, and visual analysis
Infrastructure
Telnyx
Cloud communications platform for sending and receiving faxes
Docker
Containerization for consistent development and production environments
MCP Integration
Model Context Protocol servers extend Faxi's capabilities
What is MCP?
Model Context Protocol (MCP) is an open standard that enables AI systems to securely connect with external data sources and tools. Faxi uses MCP servers to extend functionality beyond basic fax processing.