Uspeaks

Product Requirements Document (PRD): Applesauce Unified Licensing API

Project Overview

Project Name: Applesauce

Vision: A unified architectural framework for audio licensing that bifurcates into two distinct products (SKUs) to address the diverging needs of human-to-human media licensing and human-to-machine AI training.

Status: Alpha / Proof of Concept (POC)

Problem Statement

Current audio licensing is fragmented and fails to address two distinct market needs:

Granular Media Licensing: Creators need to sell specific usage rights (TV, Social, Ads) with price discrimination.
Mass AI Training: Developers need access to massive aggregate datasets without negotiating individual licenses, while creators deserve a fair share of the value.
The Two-SKU Architecture

The system is divided into two distinct licensing engines:

3.1 SKU 1: Dataset Licensing (The "Weighted" Model) Target: Media Producers, Game Developers, Ad Agencies

A user-specific, high-value license for a specific asset. Prices are determined by the "Five Knobs + Quality" framework.

The Four-Domain Engine: Categorizes every license request into one of four distinct domains:

A: Promotion: Use of the voice to sell, promote, or advertise. (e.g., Ads, Influencer content).
B: Entertainment: Use of the voice in narrative or artistic content. (e.g., Games, Film, TV).
C: Information & Utility: Functional or educational use. (e.g., GPS, Screen Readers).
D: Synthetic & Transformative: Use as data to train models or create deep fakes/clones.

The Pricing Calculator (Base * Domain * Modifiers): Calculates price based on 6 key variables (Knobs) defined in the codebase:

Quality: Q0 (Raw) to Q3 (Premium/Mastered).
Duration: Temporal validity (30 Days to Perpetual).
Reach: Geographic/Audience definition (Internal to Unlimited Digital).
Volume: Intensity of use (Single Project to Unlimited Reuse).
Exclusivity: Competitive restrictions (Non-Exclusive to Fully Exclusive).
Identity: Trust/Verification level (Tier 1 OAuth to Tier 3 KYC).

3.2 SKU 2: ML Training Licensing (The "Egalitarian" Model) Target: Foundation Model Builders, AI Companies

A broad, aggregate license for using part of the platform's library for machine learning training.

Model: Flat/Egalitarian revenue share.
Split: 70% to Human Creators / 30% to Platform.
Logic: Revenue is distributed pro-rata based on the volume of data contributed to the training set, not the specific commercial value of an individual voice.

Functional Requirements

4.1 Revenue & Transparency

Dataset SKU: Transactional revenue. Creator sets base price (or algorithms suggest it); Platform takes a fee.
ML Training SKU: Aggregate revenue. 70/30 split.
PDR (Personal Data Receipts): Machine-readable logs for creators detailing exactly which SKU was purchased and usage terms.

Technical Requirements

5.1 Architecture

Umbrella API: Unified FastAPI entry point (main_api.py) routing to sub-services.
Service A (datasetsLicensing): Handles SKU 1 logic, pricing math, and config.
Service B (MLTrainingLicensing): Handles SKU 2 logic, cohort analysis, and viability checks.

5.2 Data Architecture

Metadata: Implementation of MLCommons Croissant for embedding legal/provenance data.
Storage: Distinction between specific asset delivery (Dataset SKU) and bulk corpus access (Training SKU).

Legal & Compliance Framework

6.1 Minimum Contract Modules

Grant of Rights: Specific to the SKU (Individual License vs Data Consent).
ELVIS Act: Consent clauses for Voice Identity use (Critical for Synthetic Domain).
Liability: Caps limited to 12-month fees.

Roadmap & Success Metrics

Alpha Goal: Deploy both SKUs via the Unified API.
Metric: 100% PDR delivery rate.
Latency: Pricing calculation < 200ms.

Platform

Resources