Google’s Gemini app now accepts audio file uploads, answering what the corporate acknowledges was its most requested characteristic.
For entrepreneurs and content material groups, it means you may push recordings straight into Gemini for evaluation, summaries, and repurposed content material with out leaping between instruments.
Josh Woodward, VP at Google Labs and Gemini, introduced the change on X:
“Now you can add any file to @GeminiApp. Together with the #1 request: audio recordsdata at the moment are supported!”
What’s New
Gemini can now ingest audio recordsdata in the identical multi-file workflow you already use for paperwork and pictures.
You may connect as much as 10 recordsdata per immediate, and recordsdata inside ZIP archives are supported, which helps while you wish to add uncooked tracks or a number of interview takes collectively.
Limits
- Free plan: complete audio size as much as 10 minutes per immediate; as much as 5 prompts per day.
- AI Professional and AI Extremely: complete audio size as much as 3 hours per immediate.
- Per immediate: as much as 10 recordsdata throughout supported codecs. Particulars are listed in Google’s Assist Heart.
Why This Issues
In case your crew works with podcasts, webinars, interviews, or buyer calls, this closes a niche that always pressured a separate transcription step.
You may add a full interview and switch it into present notes, pull quotes, or a working draft in a single place. It additionally helps meeting-heavy groups: a recorded technique session can change into motion objects and a quick with out exporting to a different instrument first.
For businesses and networks, batching a number of episodes or takes into one immediate reduces friction in weekly workflows.
The sensible win is fewer handoffs: supply audio goes in, and the outlines, summaries, and excerpts you want come out. Inside the identical system you already use for textual content prompting.
Fast Tip
Add your audio along with any supporting context in the identical immediate. That provides Gemini the grounding it wants to supply cleaner summaries and extra correct excerpts.
For those who’re testing on the free tier, plan across the 10-minute ceiling; longer content material is finest on AI Professional or Extremely.
Wanting Forward
Google’s limits pages do change, so keep watch over complete size, file-count guidelines, and any new guardrails that have an effect on longer recordings or bigger groups. Additionally look ahead to deeper Workspace tie-ins (for instance, simpler handoffs from Meet recordings) that will streamline getting audio into Gemini with out handbook uploads.
Featured Picture: Picture Company/Shutterstock