DA-bench

Visual Benchmark for Data Analytics AI Agents

Question DQ13: Recognizes truly ambiguous queries.

  • Prompt: How many customers churned 2 years ago?
  • Category: Data Querying
  • Datasets: Telco

Latest Results

Tool Score Timestamp Video Recording
unsupervised 5 June 29, 2024 - 9:05 PM
Video Unavailable
databricks 5 June 17, 2024 - 08:20 PM
Video Unavailable
julius 0 June 9, 2024 - 12:55 AM
ChatGPT 0 July 9, 2024 - 07:36 PM
Video Unavailable