Intermediate

AI Data Analysis Prompts & Techniques

Data analysis is not just about running statistical tests. It is about asking the right questions, interpreting results correctly, and communicating findings in a way that drives decisions. AI accelerates every stage of this pipeline — from data cleaning to presentation — when prompted with the right structure.

That said, AI-assisted data analysis carries unique risks. A model that produces beautiful charts from garbage data is more dangerous than no analysis at all. The prompts in this guide are designed to produce not just output, but output you can trust.

This guide covers data analysis prompting for analysts, product managers, marketers, and anyone who makes decisions based on data. Every prompt was tested with both ChatGPT and Claude using real datasets.

When AI Works for Data Analysis — and When It Does Not

Task AI Suitability Why
Data cleaning and formattingHighPattern recognition at scale
Exploratory analysis suggestionsHighBroad domain knowledge
Anomaly detectionMediumGood at flagging outliers, bad at explaining why
Statistical calculationsMediumAlways verify math independently
Causal inferenceLowCannot establish causation from observational data
Ethical interpretationLowRequires human judgment on bias and fairness

Prompt 1: Exploratory Data Analysis

Start broad, then narrow down based on what the data tells you.

I have a dataset with these columns:
[COLUMN NAMES AND TYPES]

The dataset has [X] rows and represents [WHAT THE DATA IS ABOUT].

Please suggest an exploratory data analysis plan:
1. What summary statistics should I calculate first?
2. What distributions should I visualize?
3. What correlations or relationships are worth investigating?
4. What data quality issues should I check for (missing values, outliers, duplicates)?
5. What hypotheses emerge from a first look at the data?

Please provide Python or R code for each step.

Prompt 2: Insight Extraction

After exploring the data, extract actionable insights for stakeholders.

Here are the key findings from my analysis of [TOPIC]:

[SUMMARIZE KEY STATISTICS, CHARTS, OR TESTS]

I need to present this to [AUDIENCE — e.g., executives, product team, marketing].

Please:
1. Identify the 3 most important insights
2. Frame each insight as a business implication, not a statistic
3. Suggest one specific action per insight
4. Flag any findings where correlation might not equal causation
5. Recommend what additional data would strengthen the analysis

Prompt 3: Anomaly Detection

I have time-series data for [METRIC] over [TIME PERIOD].

DATA SAMPLE:
[PASTE SAMPLE OR SUMMARY]

Please:
1. Identify any anomalies, spikes, or drops that deviate from the normal pattern
2. For each anomaly, suggest the most likely causes
3. Recommend whether each anomaly requires investigation or is likely noise
4. Suggest a statistical method to automate anomaly detection going forward

Prompt 4: A/B Test Analysis

I ran an A/B test with these parameters:

- Control group: [N] users, conversion rate [X]%
- Treatment group: [N] users, conversion rate [Y]%
- Test duration: [DAYS]
- Primary metric: [METRIC]
- Significance level: [ALPHA, e.g., 0.05]

Please:
1. Calculate whether the result is statistically significant
2. Calculate confidence intervals for the lift
3. Assess practical significance (is the lift meaningful to the business?)
4. Identify potential confounding factors (seasonality, novelty effect, sample bias)
5. Recommend whether to roll out, iterate, or run a follow-up test

Prompt 5: Executive Summary

Data is only valuable if decision-makers act on it. Turn analysis into a concise executive summary.

Turn this analysis into an executive summary for [STAKEHOLDER].

ANALYSIS:
[PASTE FULL ANALYSIS]

Constraints:
- Max 250 words
- Lead with the most important recommendation
- Include one key number that should be remembered
- No jargon without explanation
- One risk or caveat must be mentioned
- End with a clear decision point

Prompting for Different Audiences

The same data tells different stories depending on who is listening. AI can help you tailor the narrative.

Here are the key findings from my A/B test analysis:

- Control conversion rate: 3.2%
- Treatment conversion rate: 3.8%
- Lift: +18.75%
- Statistical significance: p = 0.042 (significant at α = 0.05)
- Sample size: 5,000 per group
- Test duration: 14 days

AUDIENCE: Executive leadership (C-suite)

Please write a summary that:
1. Leads with the business impact (revenue implication, not p-values)
2. States the recommendation clearly in the first sentence
3. Mentions the confidence level without jargon
4. Notes any caveats in one sentence
5. Ends with the next step (roll out, run longer, or iterate)

Max 150 words. No statistical jargon.
Here are the key findings from my A/B test analysis:

[Same data as above]

AUDIENCE: Data science peer review

Please write a technical summary that:
1. Reports effect size and confidence interval, not just p-value
2. Discusses potential confounding factors (seasonality, novelty effect, selection bias)
3. Suggests follow-up analyses that would strengthen the conclusion
4. Evaluates practical significance (is an 18% lift meaningful given costs?)
5. Cites the methodology (e.g., two-proportion z-test with continuity correction)

Format as bullet points. Be precise about assumptions and limitations.

Data Analysis Ethics

  • Do not overclaim: AI loves to find patterns. Ask it explicitly about confounding variables and alternative explanations.
  • Check the math: Always verify statistical calculations independently before presenting them.
  • Context matters: A 5% conversion lift is meaningless without knowing baseline, sample size, and cost.
  • Watch for p-hacking: If you run 20 tests, one will show significance by chance. Ask the AI to correct for multiple comparisons.
  • Sample bias: AI often assumes data is representative. If your sample is biased (self-selected, geographically limited, time-bound), the analysis is wrong no matter how sophisticated.

Common Data Analysis Mistakes When Prompting AI

  • Not providing data context: A prompt that says "analyze this data" without describing what the columns mean produces generic, unhelpful output.
  • Forgetting to ask for visualizations: AI defaults to text summaries. Explicitly request charts, tables, or specific plot types.
  • Assuming AI knows your industry: "Analyze this sales data" is different for SaaS vs. e-commerce vs. B2B services. Specify.
  • Not asking about limitations: The most valuable part of an analysis is often what is NOT in the data. Ask explicitly.

Next Steps

Expand your analytical toolkit with our ChatGPT Tips and Claude Tips guides, which include platform-specific features for working with data.

← Customer Support Next: ChatGPT Tips →