Scaling test-time compute
📈
600
Boost LLM answers with flexible test‑time search strategies
Multimodal Image-to-Video
Remove backgrounds from images instantly
Generate 3D models and videos from images
Generate a 3D mesh from a single image
Generate images by blending foregrounds with custom backgrounds
Erase any object from an image with just a prompt
Generate spatial audio from images (and optionally text)
text-to-3D & image-to-3D
Media understanding
Edit image regions using a reference picture
Transcribe audio to text instantly using WebGPU
Generate animated video from two images and a prompt