Multimodal Systems

Applications that process or combine multiple data modalities — images, documents, audio, video — alongside or instead of structured tabular data.

How do we extract information and make decisions from images, scanned documents, or multimedia?

Notes