Multimodal Systems
Applications that process or combine multiple data modalities — images, documents, audio, video — alongside or instead of structured tabular data.
How do we extract information and make decisions from images, scanned documents, or multimedia?
Notes
- Visual Inspection — computer vision for manufacturing defect detection
- Document Intelligence — multimodal document understanding (invoices, forms)