LLM as an Operating System ?

LLM as an Operating System?

Since 2023, researchers have been exploring the concept of LLMs functioning as operating system. This analogy makes intutive sense when we consider how traditional operating systems serve as intermediaries between users and computer resources.

I remember encountering visualization that mapped out this transformation. In the traditional OS model, we have layers like the kernel, system calls, and user interface sitting between hardware and applications. With LLM as an OS, we can reimagine these layers, positoning language models and agentic components as the new intermediaries between user and their digital resources - whether that's a data repositories, computational tools, or planning system.

What makes this vision particularly compelling is the role of multimodal interfaces in this "compressed intellgience". Voice and vision capabilities fundamentally reshape how humans interact with thiis "cognitive OS". Instead of typing commands and clicking through menus, uses can engage in natural conversations, share visual information, and receive insights through multiple sensory channels. THis creates more intuitive bridge to both historical archives and real-time data streams, enabling deeper research and more nuanced understanding of complex information landscapes.

The implication extend beyond mere conveneience. This represents a fundamental shift in how we conceptualize the relationship between human cognition and digitial systems, moving from explicit command-based interactions to more fluid, context-aware exchanges.

Research Papers:

MemGPT: Towards LLMs as Operating System (2023)

LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem (2023)

AIOS: LLM Agent Operating System (2024)