23
0

Creating an AI Observer: Generative Semantic Workspaces

Abstract

An experienced human Observer reading a document -- such as a crime report -- creates a succinct plot-like “Working Memory”\textit{``Working Memory''} comprising different actors, their prototypical roles and states at any point, their evolution over time based on their interactions, and even a map of missing Semantic parts anticipating them in the future. An equivalent AI Observer currently does not exist\textit{An equivalent AI Observer currently does not exist}. We introduce the [G]\textbf{[G]}enerative [S]\textbf{[S]}emantic [W]\textbf{[W]}orkspace (GSW) -- comprising an “Operator”\textit{``Operator''} and a “Reconciler”\textit{``Reconciler''} -- that leverages advancements in LLMs to create a generative-style Semantic framework, as opposed to a traditionally predefined set of lexicon labels. Given a text segment CnC_n that describes an ongoing situation, the Operator\textit{Operator} instantiates actor-centric Semantic maps (termed ``Workspace instance'' Wn\mathcal{W}_n). The Reconciler\textit{Reconciler} resolves differences between Wn\mathcal{W}_n and a ``Working memory'' Mn\mathcal{M}_n^* to generate the updated Mn+1\mathcal{M}_{n+1}^*. GSW outperforms well-known baselines on several tasks (94%\sim 94\% vs. FST, GLEN, BertSRL - multi-sentence Semantics extraction, 15%\sim 15\% vs. NLI-BERT, 35%\sim 35\% vs. QA). By mirroring the real Observer, GSW provides the first step towards Spatial Computing assistants capable of understanding individual intentions and predicting future behavior.

View on arXiv
Comments on this paper