DA-bench

Visual Benchmark for Data Analytics AI Agents

Question DQ23: Can handle typos well

  • Prompt: What is the average of the "tenure_months" column of telco_customer_billing?
  • Category: Data Querying
  • Datasets: Telco

Latest Results

Tool Score Timestamp Video Recording
unsupervised 5 June 28, 2024 - 11:10AM
databricks 3 July 4, 2024 - 10:28 AM
ChatGPT 0 July 10, 2024 - 11:05 AM
Video Unavailable