nvidia/canary-1b-flash
Automatic Speech Recognition • Updated • 155k • 269
Reconstruct 3D Gaussians from unposes images.
In-browser speech recognition w/ word-level timestamps
Separate speakers in audio recordings
Highlight sound sources in images using audio