Multimodal Presentation Example Model

FUDOKI: A Unified Multimodal Model Purely Based on Discrete Flow Matching (NeurIPS 2025 spotlight)

The rapid progress of large language models (LLMs) has catalyzed the emergence of multimodal large language models (MLLMs) that unify visual understanding and image generation within a single ...

Microsoft

Magma: A Foundation Model for Multimodal AI Agents

We present Magma, a foundation model that serves multimodal AI agentic tasks in both the digital and physical worlds. Magma is a significant extension of vision-language (VL) models in that it not ...

Unite.AI

The Coming Wave of Multimodal Attacks: When AI Tools Become the New Exploit Surface

As large language models (LLMs) evolve into multimodal systems that can handle text, images, voice and code, they’re also becoming powerful orchestrators of external tools and connectors. With this ...

GitHub

Towards Efficient Multimodal Unified Reasoning Model via Model Merging

Although Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities across diverse tasks, they encounter challenges in terms of reasoning efficiency, large model size and ...

IEEE

PV-PASBLS: A Multimodal Point-View Fusion Model Based on Parameter Adaptive Stacked Broad Learning System for 3-D Shape Recognition

Abstract: Most existing multimodal point-view fusion models for 3-D shape recognition typically improve recognition accuracy through complex feature fusion mechanisms. However, these mechanisms ...

The Robot Report

Ai2 says its Molmo 2 multimodal AI model can do more with less data

The Allen Institute for AI, also known as Ai2, last week released Molmo 2, its latest multimodel suite capable of precise spatial and temporal understanding of video, image, and multi-image sets.

1011 Now

Lincoln StarTran Multi-modal Transportation Center earns architecture award for concept, presentation

LINCOLN, Neb. (KOLN) - The StarTran Multi-modal Transportation Center received an Excellence in Unbuilt Architecture Honor Award for the project’s concept and presentation, Lincoln Transportation and ...

Frontiers

MoltiTox: a multimodal fusion model for molecular toxicity prediction

1 Department of Computer Science and Engineering, Sungkyunkwan University, Suwon, Republic of Korea 2 Department of Systems Management Engineering, Sungkyunkwan University, Suwon, Republic of Korea ...

Journal of Medical Internet Research

Multimodal Data–Driven Explainable Prognostic Model for Major Adverse Cardiovascular Events Prediction in Patients With Unstable Angina and Heart Failure With Preserved ...

The model with the best C-index performance was deployed as a web-based risk calculator. Additionally, we assessed other performance indicators of the best-performing model, including the area under ...

IEEE

A Multimodal Contrastive and Transfer Learning-Based Image Restoration Model for Multiple Adverse Weather Driving Scenes

Abstract: Adverse weather conditions like rain, fog, and snow significantly hinder perception in autonomous driving systems. This paper proposes a multimodal contrastive learning and transfer learning ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results