3. Getting Started with Android Generative AI
Written by Zahidur Rahman Faisal

Heads up... You’re accessing parts of this content for free, with some sections shown as scrambled text.

Unlock our entire catalogue of books and courses, with a Kodeco Personal Plan.
Unlock now

Google provides a multifaceted AI ecosystem, offering developers a range of tools and models to integrate intelligence into their Android applications, from lightweight on-device solutions to powerful cloud-based generative AI. However, finding the right AI/ML solution for your app can be tricky! This chapter guides you in selecting the most suitable AI solution for your app.

To simplify your decision, ask yourself this:

What is the primary goal of the AI feature?

Use Generative AI if you’re generating new content that is fairly simple (e.g. text or image) or performing simple text processing, such as, summarizing, proofreading, or rewriting text.

Use Traditional ML for analyzing existing data or input for prediction or for processing real-time streams like video or audio to classify, detect, or understand patterns.

Gemini Models: The Foundation of Intelligent Android Experiences

The Gemini family of models forms the backbone of Google’s AI strategy, offering different sizes and capabilities optimized for various use cases. The existence of Gemini Nano, Flash, and Pro demonstrates a deliberate strategy to provide a spectrum of AI capabilities — Nano for on-device, Flash for efficient cloud tasks, and Pro for complex, high-reasoning cloud tasks.

This tiered approach allows Android developers to precisely match the AI model to their application’s specific requirements regarding computational power, latency, privacy, and cost. It ensures that AI integration is accessible for a wide range of devices and use cases, from simple offline features to highly complex, cloud-powered generative experiences.

Gemini Nano

Optimized for on-device use cases, it enables generative AI experiences without requiring a network connection or sending data to the cloud.

Gemini Flash

A powerful and efficient workhorse model designed for speed and low cost, making it ideal for fast performance on everyday tasks.

Gemini Pro

Google’s most advanced model, excelling at complex prompts, enhanced reasoning, and advanced coding tasks.

Wmuurisd qiyxuak OE/DJ xokuseulk

Choosing Between On-device vs Cloud-based Approach

When integrating AI/ML features into your Android app, you must decide whether to process data on the device or in the cloud. Tools like ML Kit, Gemini Nano, and TensorFlow Lite enable on-device capabilities, while Gemini cloud APIs with Firebase AI Logic offer powerful cloud-based processing.

On-device Generative AI

Gemini Nano is the core of Android’s on-device large language model that runs locally without a network. It is built into Android’s AICore system service, leveraging device hardware for low-latency inference and keeping user data on-device.

Cloud Generative AI

Use Cloud Generative AI when you need capabilities beyond what on-device models can handle. For example, long document analysis, code generation at scale, or multimodal tasks involving large images or video. Gemini in the cloud can process text, images, audio, and video inputs (as long as you send them over the network).

Google Cloud Platform

Another cloud-based solution is Google Cloud Platform, which is suitable if you are willing to manage your own backend integration and need:

Conclusion

If there’s one thing I hope you take away from this chapter, it’s this: getting started with generative AI on Android isn’t about choosing the best model — it’s about choosing the right model for what you’re trying to build. You’ve just seen how Nano gives you fast, private, offline intelligence right on the device, while Flash and Pro open the doors to powerful cloud reasoning, multimodality, and massive context windows. The real skill is learning to map your feature to the right model, just like choosing the right architecture pattern or database engine. As Android developers, we’re now expected to think about latency, privacy, hardware constraints, and cost in the same breath as UX. That’s new—and exciting!

Kwoev Woxihasozu AU

Appendix: Quiz on Android Generative AI Solutions

Let’s check your takeaways from this chapter by validating some app ideas.

The Options

1. The Smart “Note-Taker” App

Scenario: You are building an intelligent note-taking application. A core feature is the ability for a user to select a section of text and instantly receive a shorter, concise summary. This feature must function offline and requires the easiest integration for such a streamlined task.

2. The “Artistic Profile” App

Scenario: A popular social media app needs a feature that allows users to input a descriptive prompt (“A traveller playing a flute”) and have a unique, high-quality image generated for their profile picture.

3. The “Long-form Editor” App

Scenario: Your professional document editor needs an AI assistant that can analyze a large, complex document (e.g., a 100-page PDF) and answer nuanced questions about its content. This requires the model with the highest reasoning capability and the largest context window, and you prefer to leverage your existing Firebase infrastructure.

Answer Key and Explanation

1. The Smart “Note-Taker” App

2. The “Artistic Profile” App

3. The “Long-form Editor” App

Have a technical question? Want to report a bug? You can ask questions and report bugs to the book authors in our official book forum here.

Chapters

Practical Android AI

Before You Begin

Section I: Foundations of AI on Android

Section II: Building Core Intelligence

Section III: Advanced Integration, Distribution, and Responsible AI

3. Getting Started with Android Generative AI
Written by Zahidur Rahman Faisal

Gemini Models: The Foundation of Intelligent Android Experiences

Gemini Nano

Gemini Flash

Gemini Pro

Choosing Between On-device vs Cloud-based Approach

On-device Generative AI

Cloud Generative AI

Google Cloud Platform

Conclusion

Appendix: Quiz on Android Generative AI Solutions

The Options

1. The Smart “Note-Taker” App

2. The “Artistic Profile” App

3. The “Long-form Editor” App

Answer Key and Explanation

1. The Smart “Note-Taker” App

2. The “Artistic Profile” App

3. The “Long-form Editor” App

Chapters

Practical Android AI

Before You Begin

Section I: Foundations of AI on Android

Section II: Building Core Intelligence

Section III: Advanced Integration, Distribution, and Responsible AI

Gemini Models: The Foundation of Intelligent Android Experiences

Gemini Nano

Gemini Flash

Gemini Pro

Choosing Between On-device vs Cloud-based Approach

On-device Generative AI

Cloud Generative AI

Google Cloud Platform

Conclusion

Appendix: Quiz on Android Generative AI Solutions

The Options

1. The Smart “Note-Taker” App

2. The “Artistic Profile” App

3. The “Long-form Editor” App

Answer Key and Explanation

1. The Smart “Note-Taker” App

2. The “Artistic Profile” App

3. The “Long-form Editor” App

Access this book