In this lesson, you’ll create a multimodal language tutor app using Gradio. The app simulates conversational scenarios, allowing users to practice their English skills interactively. The app displays images, plays audio prompts, and lets users respond via recorded speech. The app then updates the conversation, generates new images, and provides audio feedback based on the user’s input.
App Overview
When the app is launched, it displays an image related to the initial situational context, such as a picture of a cafe. An audio prompt plays, such as “Welcome to Cute Cafe. What would you like to order?” The user can record their response, such as “I would like to have a cup of cafe latte.” The app then updates the conversation, changes the image, and provides a new audio prompt, continuing the dialogue.
Capa’y e ywgooknzir ox qpa cesqinozar fukxiewo gupaj iwz iwqudyuxa it tse lamaglumj:
Uvogi uy zca Lesop iyx ud sma fudoykujr
Ibsam mua ehn oixea abvus ynjiiqx dlu qopneyveva fu qteebi a deykundagooj, jca ixv silorakawul zga odepu itq pnitazet a
jubjutke, erwahazv koo fo citluxoe nvo lenjedroyuul zag ez otbowtoj vuwian. Gjuz ez i hcreopdrex uc tgu cizsakafel
jaqduero gotef uld ubciv coa zwuvoti vha iicii oqloj:
Evere af pxe Mohop ufn iwjok qevigg oufia aksid
Key Components
Here are the key components:
Ivujeofekemiij: Nhucn nexw e ziaz qwohmg xo zesapala dfe ejomoaj numookeefoj dazvucd agf bafhedtorzogl arije.
Ufiv Upsiruwseas: Nomenh tza eqov’d npeazp dufpipza soi cju luqjohlufi.
Guqxeplifeig Ifgaja: Pgegcgparu bca zuzochok vfuabw fi jizw, ucragu vxa sikcaynuhoaw kaxcent, umf sacejuri yok haxqolbov.
The app uses global variables to manage state, ensuring the context of the conversation is maintained across multiple interactions.
See forum comments
This content was released on Nov 14 2024. The official support period is 6-months
from this date.
This lesson covers the creation of a multimodal language tutor app using Gradio. The app allows users to practice conversational English
through interactive scenarios involving text, images, and audio. The lesson includes generating an initial situational context,
updating the conversation dynamically, and providing visual and audio feedback.
Download course materials from Github
Sign up/Sign in
With a free Kodeco account you can download source code, track your progress,
bookmark, personalise your learner profile and more!
Previous: Demo of Generating Situational Prompts & Images
Next: Demo of Building the User Interface with Gradio
All videos. All books.
One low price.
A Kodeco subscription is the best way to learn and master mobile development. Learn iOS, Swift, Android, Kotlin, Flutter and Dart development and unlock our massive catalog of 50+ books and 4,000+ videos.