Text Generation

Image Generation

Generated image

Structured JSON


  

Vision Analysis

Audio: Transcription


  

Audio: Text-to-Speech

Bonus: Translate + Speak

Translated text: