Draft:Spatial intelligence (artificial intelligence)


Spatial intelligence is a term used in artificial intelligence research to describe systems capable of perceiving, understanding, reasoning about, generating, and interacting with three-dimensional physical and virtual environments. It emphasizes "world models" that incorporate spatial relationships, geometry, physics, and dynamics, in contrast to text- or image-centric models.[1][2]

The concept has been prominently advocated by computer scientist Fei-Fei Li, who has described it as a necessary next step for artificial intelligence beyond large language models. Li co-founded World Labs in 2024 to develop related technologies.[3][4]

Definition

According to Stanford HAI, spatial intelligence in artificial intelligence refers to systems that can understand and reason about the three-dimensional physical world, including how objects relate to each other in space, how they move, and how they interact.[2]

Fei-Fei Li has characterized it as the ability of artificial intelligence to perceive, reason about, generate, and interact with 3D environments in a manner grounded in physical reality, contrasting it with the more abstract capabilities of large language models.[1][5]

History

The modern usage of the term in artificial intelligence gained attention in 2024 when Fei-Fei Li began publicly promoting spatial intelligence as a key research direction. In a May 2024 TED Talk and subsequent writings, she argued that artificial intelligence systems need this capability to achieve more human-like understanding of the physical world.[6]

Li co-founded World Labs in early 2024 to pursue this area. The company raised $230 million in its initial round and an additional $1 billion in February 2026.[3][7]

NVIDIA operates a Spatial Intelligence Lab (SIL) focused on related technologies for perception, modeling, and interaction with the physical world.[8]

Research and development

Research groups working in this area include:

  • Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) and Li’s Stanford Vision and Learning Lab.
  • NVIDIA’s Spatial Intelligence Lab.
  • World Labs, which released Marble, a multimodal world model for generating and editing 3D environments.[9]

The concept builds on earlier work in computer vision, robotics, and world modeling, but the specific framing as "spatial intelligence" is recent and closely associated with Li’s advocacy.

Technologies

Enabling technologies discussed in relation to spatial intelligence include computer vision and multimodal models for 3D perception, as well as generative 3D techniques such as 3D Gaussian Splatting used in models like World Labs’ Marble to produce spatially consistent, persistent, and navigable environments from text, image, video, or panorama inputs.[9][4]

NVIDIA’s Spatial Intelligence Lab advances foundational technologies for artificial intelligence systems to perceive, model, and interact with the physical world.[8]

Applications

Proponents suggest potential uses in robotics and embodied artificial intelligence (such as navigation, manipulation, and human-robot collaboration), creative tools for film, video games, and architecture (for example, rapid generation of explorable 3D worlds with Marble), scientific simulation, and industrial planning including facility modeling, safety scenario testing, and operational strategy rehearsal.[9][4] These applications remain largely prospective as of 2026.[5]

Industry examples

  • World Labs develops multimodal world models such as Marble for generating and editing editable 3D environments.[9]
  • NVIDIA researches and invests in spatial intelligence through its dedicated Spatial Intelligence Lab.[8]
  • Butlr (an MIT Media Lab spin-out) has developed privacy-first thermal-sensing systems that apply AI to understand people’s movements and behaviors in physical spaces for occupancy monitoring, energy optimization, and healthcare applications in buildings.[10]

Relationship to other concepts

Spatial intelligence is often discussed alongside or as complementary to world models, embodied artificial intelligence, and spatial computing. It focuses on the artificial intelligence system’s internal representation and reasoning about space, whereas spatial computing more commonly refers to user-facing interfaces in 3D environments.

See also

References

  1. ^ a b Strickland, Eliza (December 12, 2024). "AI Godmother Fei-Fei Li Has a Vision for Computer Vision". IEEE Spectrum. Retrieved 2026-05-31.
  2. ^ a b "What is Spatial Intelligence?". Stanford Institute for Human-Centered Artificial Intelligence. Retrieved 2026-05-31.
  3. ^ a b "Exclusive: Stanford AI leader Fei-Fei Li building 'spatial intelligence' startup". Reuters. May 3, 2024. Retrieved 2026-05-31.
  4. ^ a b c "Fei-Fei Li of World Labs: AI is incomplete without spatial intelligence". Financial Times. December 16, 2025. Retrieved 2026-05-31.
  5. ^ a b Li, Fei-Fei (December 11, 2025). "Spatial Intelligence Is AI's Next Frontier". TIME. Retrieved 2026-05-31.
  6. ^ Fei-Fei Li (May 2024). "With spatial intelligence, AI will understand the real world". TED. Retrieved 2026-05-31.
  7. ^ "AI pioneer Fei-Fei Li's World Labs raises $1 billion in funding". Yahoo Finance. February 18, 2026. Retrieved 2026-05-31.
  8. ^ a b c "NVIDIA Spatial Intelligence Lab (SIL)". NVIDIA Research. Retrieved 2026-05-31.
  9. ^ a b c d "AI Pioneer Fei-Fei Li Ushers In Next Frontier Of Artificial Intelligence". November 20, 2025. Retrieved 2026-05-31.
  10. ^ "Creating smart buildings with privacy-first sensors". MIT News. February 11, 2025. Retrieved 2026-05-31.

Category:Artificial intelligence Category:Computer vision Category:Robotics

Content Disclaimer

Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.

  1. The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
  2. There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
  3. It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
  4. Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
  5. Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.