Creating an AI Observer: Generative Semantic Workspaces

An experienced human Observer reading a document -- such as a crime report -- creates a succinct plot-like comprising different actors, their prototypical roles and states at any point, their evolution over time based on their interactions, and even a map of missing Semantic parts anticipating them in the future. . We introduce the enerative emantic orkspace (GSW) -- comprising an and a -- that leverages advancements in LLMs to create a generative-style Semantic framework, as opposed to a traditionally predefined set of lexicon labels. Given a text segment that describes an ongoing situation, the instantiates actor-centric Semantic maps (termed ``Workspace instance'' ). The resolves differences between and a ``Working memory'' to generate the updated . GSW outperforms well-known baselines on several tasks ( vs. FST, GLEN, BertSRL - multi-sentence Semantics extraction, vs. NLI-BERT, vs. QA). By mirroring the real Observer, GSW provides the first step towards Spatial Computing assistants capable of understanding individual intentions and predicting future behavior.
View on arXiv