DiCoRe, a new reasoning framework, enhances zero-shot event detection in text by decoupling event discovery and alignment using divergent and convergent reasoning stages, validated by an LLM-Judge. Experiments across six datasets and nine LLMs demonstrate consistent performance gains of 4-7% over existing methods.
The automated identification of events within unstructured text – a core challenge in natural language processing – receives attention from researchers seeking to improve performance without reliance on labelled training data. This is particularly relevant for specialised fields where such data is scarce. A team comprising Tanmay Parekh, Kai-Wei Chang and Nanyun Peng from the University of California, Los Angeles, alongside Kartik Mehta and Ninareh Mehrabi from Amazon AGI Foundations, detail their approach in a paper entitled ‘DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning’. Their work introduces a novel framework designed to decouple event detection using a two-stage process of divergent and convergent reasoning, coupled with a verification stage, to improve the accuracy of large language models (LLMs) in this task.
Novel Reasoning Framework Advances Zero-Shot Event Detection
Researchers at Kairos AI have developed DiCoRe, a divergent-convergent reasoning framework, which consistently surpasses existing methods in zero-shot Event Detection (ED). This capability – identifying event mentions within text without prior training on specific event types – represents a significant advance in natural language processing. The team addressed inherent limitations in applying large language models (LLMs) directly to this complex task, achieving improvements in both event coverage and precision.
DiCoRe decouples ED into distinct stages. It begins with a ‘Dreamer’ component that generates a broad range of potential events through open-ended discovery, expanding the scope of possible identifications. This is followed by a ‘Grounder’ component, which aligns these predictions with task-specific instructions. The Grounder utilises constrained decoding, guided by a finite-state machine – a computational model with a finite number of states and transitions – to focus the model’s output and ensure relevance to the defined event ontology (a structured representation of knowledge). Finally, an LLM-based ‘Judge’ component verifies the final predictions, refining accuracy and providing insights into the model’s reasoning.
Quantitative results, presented across six datasets spanning five domains, demonstrate DiCoRe’s superior performance. The framework consistently achieved average F1 gains of 4-7% over the strongest baseline models. The F1-score is a measure of a test’s accuracy, calculated as the harmonic mean of precision and recall. This robustness and adaptability establish DiCoRe as a reliable solution for zero-shot event detection.
Qualitative analysis reveals the value of DiCoRe’s component-wise approach. Researchers can gain insight into how the model arrives at its conclusions and identify strengths and weaknesses. By breaking down the reasoning process into Dreamer, Grounder, and Judge stages, the team can pinpoint areas for refinement and develop strategies for improving performance. This transparency is crucial for building trust in the system and ensuring its reliability in real-world applications.
Examination of component contributions, facilitated by the ‘Judge’ component, reveals how the interplay between Dreamer and Grounder drives accurate results and provides valuable insights into the model’s reasoning process. Instances of error highlight areas where nuanced language or additional contextual information pose challenges, guiding future research and development efforts.
Despite strong overall performance, limitations remain. The model occasionally misidentifies event types or fails to capture all relevant arguments, suggesting a need for improved handling of complex linguistic structures and contextual dependencies. Further refinement of the finite-state machine guiding the Grounder component could also enhance alignment with task-specific instructions and improve the model’s ability to accurately identify and classify events.
Future work will investigate incorporating external knowledge sources to augment contextual understanding and improve the model’s ability to accurately identify and classify events. Researchers plan to explore the use of knowledge graphs and other external resources to provide additional context and information and enhance the model’s reasoning capabilities. They also intend to investigate new techniques for improving the finite-state machine guiding the Grounder component.
The team anticipates that these future research efforts will lead to further improvements in DiCoRe’s performance and enable new applications in areas such as knowledge discovery and decision support. DiCoRe’s demonstrated ability to accurately identify events across diverse domains suggests its potential application in areas such as news analysis, security monitoring, and medical information extraction, offering significant benefits to a wide range of industries.
👉 More information
🗞 DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning
🧠 DOI: https://doi.org/10.48550/arXiv.2506.05128
