Project Overview
A multimodal tutoring assistant for education workflows, combining text, image, and voice interaction in a single demonstration-oriented pipeline.

Background & Goal
The target is a controllable and explainable tutoring assistant with clearer context continuity and better reporting of evaluation signals.
This sample intentionally leaves links empty so the Links block can be conditionally hidden.