Multimodal AI
AI that processes and understands multiple types of data like text, images, and drawings simultaneously.
Definition
Multimodal AI refers to artificial intelligence systems that can process, understand, and connect information across different data modalities—including text documents, images, CAD drawings, PDFs, and tables. In the AEC context, this means AI that can simultaneously analyze a drawing (visual) and its associated specification (text) to understand the complete design intent, enabling more sophisticated analysis and search capabilities than text-only or image-only AI systems.
Examples
Searching for details by uploading a photo or sketch
Understanding both the visual elements of a drawing and the text annotations
Connecting specification requirements to relevant drawing details
Related Use Cases
Automated Drawing Reviews
Firmwide Detail Search
Related Terms
Related Keywords
Ready to use Multimodal AI in your firm?
See how Nomic brings domain-specific AI to architecture, engineering, and construction firms.

