AI

AgentArch: A Comprehensive Benchmark to Evaluate Agent Architectures in Enterprise

Source: arXiv AI Papers
This study offers a comprehensive benchmark for evaluating 18 distinct agentic configurations within advanced large language models. Key dimensions assessed…
Read more: AgentArch: A Comprehensive Benchmark to Evaluate Agent Architectures in Enterprise
LLM Enhancement with Domain Expert Mental Model to Reduce LLM Hallucination with Causal Prompt Engineering

Source: arXiv AI Papers
This paper discusses the application of large language models (LLMs) and Retrieval-Augmented Generation (RAG) for improving decision-making processes. It addresses…
Read more: LLM Enhancement with Domain Expert Mental Model to Reduce LLM Hallucination with Causal Prompt Engineering
Is the `Agent’ Paradigm a Limiting Framework for Next-Generation Intelligent Systems?

Source: arXiv AI Papers
This paper critiques the agent-centric paradigm in AI, discussing its limitations and biases. It suggests a shift towards system-level dynamics…
Read more: Is the `Agent’ Paradigm a Limiting Framework for Next-Generation Intelligent Systems?
Enhancing Computational Cognitive Architectures with LLMs: A Case Study

Source: arXiv AI Papers
This article discusses the integration of large language models (LLMs) with computational cognitive architectures, specifically the Clarion architecture. The research…
Read more: Enhancing Computational Cognitive Architectures with LLMs: A Case Study
Rethinking Human Preference Evaluation of LLM Rationales

Source: arXiv AI Papers
This study investigates the evaluation of rationales generated by large language models (LLMs). It proposes an attribute-based approach for assessing…
Read more: Rethinking Human Preference Evaluation of LLM Rationales
Free-MAD: Consensus-Free Multi-Agent Debate

Source: arXiv AI Papers
Free-MAD is a new framework designed to enhance the reasoning abilities of large language models by eliminating the need for…
Read more: Free-MAD: Consensus-Free Multi-Agent Debate
Tractable Asymmetric Verification for Large Language Models via Deterministic Replicability

Source: arXiv AI Papers
This paper presents a novel verification framework to address trust issues in multi-agent systems utilizing Large Language Models (LLMs). It…
Read more: Tractable Asymmetric Verification for Large Language Models via Deterministic Replicability
Difficulty-Aware Agent Orchestration in LLM-Powered Workflows

Source: arXiv AI Papers
A new framework, Difficulty-Aware Agentic Orchestration (DAAO), aims to enhance the efficiency of multi-agent systems using Large Language Models (LLMs).…
Read more: Difficulty-Aware Agent Orchestration in LLM-Powered Workflows
Neural cellular automata: applications to biology and beyond classical AI

Source: arXiv AI Papers
Neural Cellular Automata (NCA) combine traditional rule-based systems with trainable neural networks to model biological self-organization. This framework has applications…
Read more: Neural cellular automata: applications to biology and beyond classical AI
AlignKT: Explicitly Modeling Knowledge State for Knowledge Tracing with Ideal State Alignment

Source: arXiv AI Papers
AlignKT proposes an innovative approach to knowledge tracing in intelligent tutoring systems by explicitly modeling a stable knowledge state. It…
Read more: AlignKT: Explicitly Modeling Knowledge State for Knowledge Tracing with Ideal State Alignment