Posted by Paris Hsu – Product Supervisor, Android Studio
At each stage of the event lifecycle, Gemini in Android Studio has grow to be your AI-powered companion, making it simpler to construct top quality apps. We’re excited to announce a major enlargement: Gemini in Android Studio now helps multimodal inputs, which helps you to connect photographs on to your prompts! This unlocks a wealth of latest prospects that enhance crew collaboration and UI growth workflows.
You possibly can check out this new characteristic by downloading the newest Android Studio canary. We’ve outlined just a few use instances to strive, however we’d love to listen to what you assume as we work via bringing this characteristic into future secure releases. Test it out:
Picture attachment – a brand new dimension of interplay
We first previewed Gemini’s multimodal capabilities at Google I/O 2024. This know-how permits Gemini in Android Studio to know easy wireframes, and remodel them into working Jetpack Compose code.
You will now discover a picture attachment icon within the Gemini chat window. Merely connect JPEG or PNG information to your prompts and watch Gemini perceive and reply to visible info. We have noticed that photographs with robust coloration contrasts yield one of the best outcomes.


We encourage you to experiment with numerous prompts and pictures. Listed here are just a few compelling use instances to get you began:
- Fast UI prototyping and iteration: Convert a easy wireframe or high-fidelity mock of your app’s UI into working code.
- Diagram clarification and documentation: Achieve deeper insights into advanced structure or information circulate diagrams by having Gemini clarify their parts and relationships.
- UI troubleshooting: Seize screenshots of UI bugs and ask Gemini for options.
Fast UI prototyping and iteration
Gemini’s multimodal help allows you to convert visible designs into purposeful UI code. Merely add your picture and use a transparent immediate. It really works whether or not you are working from your individual sketches or from a designer mockup.
Right here’s an instance immediate: “For this picture supplied, write Android Jetpack Compose code to make a display that is as near this picture as potential. Be certain to incorporate imports, use Material3, and doc the code.” After which you’ll be able to append any particular or extra directions associated to the picture.


For extra advanced UIs, refine your prompts to seize particular performance. As an illustration, when changing a calculator mockup, including “make the interactions and calculations work as you’d count on” ends in a completely purposeful calculator:


Be aware: this characteristic offers an preliminary design scaffold. It’s a great “first draft” and your edits and changes shall be wanted. Frequent refinements embody making certain appropriate drawable imports and importing icons. Contemplate the generated code a extremely environment friendly start line, accelerating your UI growth workflow.
Diagram clarification and documentation
With Gemini’s multimodal capabilities, you can even strive importing a picture of your diagram and ask for explanations or documentation.
Instance immediate: Add the Now in Android structure diagram and say “Clarify the parts and information circulate on this diagram” or “Write documentation about this diagram”.

UI troubleshooting
Leverage Gemini’s visible evaluation to establish and resolve bugs shortly. Add a screenshot of the problematic UI, and Gemini will analyze the picture and counsel potential options. You too can embody related code snippets for extra exact help.
Within the instance under, we used Compose UI verify and located that the button is stretched too extensive in pill screens, so we took a screenshot and requested Gemini for options – it was in a position to leverage the window measurement lessons to offer the best repair.

Obtain Android Studio immediately
Obtain the newest Android Studio canary immediately to strive the brand new multimodal options!
As at all times, Google is dedicated to the accountable use of AI. Android Studio will not ship any of your supply code to servers with out your consent. You possibly can learn extra on Gemini in Android Studio’s dedication to privateness.
We recognize any suggestions on stuff you like or options you want to see. In the event you discover a bug, please report the difficulty and likewise try recognized points. Keep in mind to additionally observe us on X, Medium, or YouTube for extra Android growth updates!