🎓 ElaRech – Multimodal Virtual Learning Assistant

ElaRech is an intelligent, voice-interactive multimodal Virtual Learning assistant that helps students explore and understand visual academic content like diagrams, charts, handwritten notes, or Virtual Learning papers. Just speak your query, upload an image, and ElaRech will answer both visually and audibly.

📸 Demo

🧠 Model Used

Multimodal Model: meta-llama/llama-4-scout-17b-16e-instruct via Groq API
Voice Recognition: Whisper
TTS Engines: Google gTTS & ElevenLabs

Tech Stack

Groq API – Ultra-fast LLM API LLaMA-4 Vision Model – meta-llama/llama-4-scout-17b-16e-instruct Whisper – For speech recognition gTTS & ElevenLabs – For voice output Gradio – For building the web interface

🔍 Features

🎙️ Voice Input: Speak your Virtual Learning question naturally.
🧠 Multimodal AI: Combines your voice query with an uploaded image to give smart, context-aware answers.
🖼️ Image Understanding: Upload diagrams, charts, handwritten pages, or screenshots — ElaRech understands them.
💬 LLM-Powered Responses: Powered by meta-llama/llama-4-scout-17b-16e-instruct via Groq API.
🔊 Dual TTS Engines: Replies are spoken aloud using both gTTS and ElevenLabs.
🌐 Gradio Web Interface: Clean, easy-to-use interface accessible from your browser.

📁 Project Structure

ElaRech/ ├── gradio_app.py # Main app with Gradio interface ├── brain_of_the_elaRech.py # Core logic for image + query processing ├── .env ├── voice_of_Virtual Learninger.py └── voice_of_user.py

⚙️ Setup Instructions

1. Clone the Repo

git clone https://github.com/iamafridi/elaRech.git
cd elaRech

Set Environment Variables

GROQ_API_KEY=your_groq_api_key ELEVENLABS_API_KEY=your_elevenlabs_api_key

Run the App

python gradio_app.py

🎓 Example Use Case

Upload a diagram and ask:

🗣️ "Explain this process in simple terms."

📢 ElaRech will generate a voice and text response explaining the diagram based on your question.

📜 License MIT License

👤 Author

Afridi Akbar Ifty GitHub: https://github.com/iamafridi Portfolio : https://iamafrididev.netlify.app LinkedIn: your-linkedin-profile

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
__pycache__		__pycache__
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
brain_of_the_elaRech.py		brain_of_the_elaRech.py
elevenlabs_testing.mp3		elevenlabs_testing.mp3
elevenlabs_testing.wav		elevenlabs_testing.wav
elevenlabs_testing_autoplay.mp3		elevenlabs_testing_autoplay.mp3
final.mp3		final.mp3
final.wav		final.wav
gradio_app.py		gradio_app.py
gtts_testing.mp3		gtts_testing.mp3
gtts_testing.wav		gtts_testing.wav
gtts_testing_autoplay.mp3		gtts_testing_autoplay.mp3
test1.png		test1.png
user_voice_test_for_user.mp3		user_voice_test_for_user.mp3
voice_of_the_researcher.py		voice_of_the_researcher.py
voice_of_the_user.py		voice_of_the_user.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎓 ElaRech – Multimodal Virtual Learning Assistant

🧠 Model Used

Tech Stack

🔍 Features

📁 Project Structure

⚙️ Setup Instructions

1. Clone the Repo

Set Environment Variables

Run the App

🎓 Example Use Case

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎓 ElaRech – Multimodal Virtual Learning Assistant

🧠 Model Used

Tech Stack

🔍 Features

📁 Project Structure

⚙️ Setup Instructions

1. Clone the Repo

Set Environment Variables

Run the App

🎓 Example Use Case

👤 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages