AI Data Analysis Prompts & Techniques
Data analysis is not just about running statistical tests. It is about asking the right questions, interpreting results correctly, and communicating findings in a way that drives decisions. AI accelerates every stage of this pipeline — from data cleaning to presentation — when prompted with the right structure.
That said, AI-assisted data analysis carries unique risks. A model that produces beautiful charts from garbage data is more dangerous than no analysis at all. The prompts in this guide are designed to produce not just output, but output you can trust.
This guide covers data analysis prompting for analysts, product managers, marketers, and anyone who makes decisions based on data. Every prompt was tested with both ChatGPT and Claude using real datasets.
When AI Works for Data Analysis — and When It Does Not
| Task | AI Suitability | Why |
|---|---|---|
| Data cleaning and formatting | High | Pattern recognition at scale |
| Exploratory analysis suggestions | High | Broad domain knowledge |
| Anomaly detection | Medium | Good at flagging outliers, bad at explaining why |
| Statistical calculations | Medium | Always verify math independently |
| Causal inference | Low | Cannot establish causation from observational data |
| Ethical interpretation | Low | Requires human judgment on bias and fairness |
Prompt 1: Exploratory Data Analysis
Start broad, then narrow down based on what the data tells you.
I have a dataset with these columns:
[COLUMN NAMES AND TYPES]
The dataset has [X] rows and represents [WHAT THE DATA IS ABOUT].
Please suggest an exploratory data analysis plan:
1. What summary statistics should I calculate first?
2. What distributions should I visualize?
3. What correlations or relationships are worth investigating?
4. What data quality issues should I check for (missing values, outliers, duplicates)?
5. What hypotheses emerge from a first look at the data?
Please provide Python or R code for each step.
Prompt 2: Insight Extraction
After exploring the data, extract actionable insights for stakeholders.
Here are the key findings from my analysis of [TOPIC]:
[SUMMARIZE KEY STATISTICS, CHARTS, OR TESTS]
I need to present this to [AUDIENCE — e.g., executives, product team, marketing].
Please:
1. Identify the 3 most important insights
2. Frame each insight as a business implication, not a statistic
3. Suggest one specific action per insight
4. Flag any findings where correlation might not equal causation
5. Recommend what additional data would strengthen the analysis
Prompt 3: Anomaly Detection
I have time-series data for [METRIC] over [TIME PERIOD].
DATA SAMPLE:
[PASTE SAMPLE OR SUMMARY]
Please:
1. Identify any anomalies, spikes, or drops that deviate from the normal pattern
2. For each anomaly, suggest the most likely causes
3. Recommend whether each anomaly requires investigation or is likely noise
4. Suggest a statistical method to automate anomaly detection going forward
Prompt 4: A/B Test Analysis
I ran an A/B test with these parameters:
- Control group: [N] users, conversion rate [X]%
- Treatment group: [N] users, conversion rate [Y]%
- Test duration: [DAYS]
- Primary metric: [METRIC]
- Significance level: [ALPHA, e.g., 0.05]
Please:
1. Calculate whether the result is statistically significant
2. Calculate confidence intervals for the lift
3. Assess practical significance (is the lift meaningful to the business?)
4. Identify potential confounding factors (seasonality, novelty effect, sample bias)
5. Recommend whether to roll out, iterate, or run a follow-up test
Prompt 5: Executive Summary
Data is only valuable if decision-makers act on it. Turn analysis into a concise executive summary.
Turn this analysis into an executive summary for [STAKEHOLDER].
ANALYSIS:
[PASTE FULL ANALYSIS]
Constraints:
- Max 250 words
- Lead with the most important recommendation
- Include one key number that should be remembered
- No jargon without explanation
- One risk or caveat must be mentioned
- End with a clear decision point
Prompting for Different Audiences
The same data tells different stories depending on who is listening. AI can help you tailor the narrative.
Here are the key findings from my A/B test analysis:
- Control conversion rate: 3.2%
- Treatment conversion rate: 3.8%
- Lift: +18.75%
- Statistical significance: p = 0.042 (significant at α = 0.05)
- Sample size: 5,000 per group
- Test duration: 14 days
AUDIENCE: Executive leadership (C-suite)
Please write a summary that:
1. Leads with the business impact (revenue implication, not p-values)
2. States the recommendation clearly in the first sentence
3. Mentions the confidence level without jargon
4. Notes any caveats in one sentence
5. Ends with the next step (roll out, run longer, or iterate)
Max 150 words. No statistical jargon.
Here are the key findings from my A/B test analysis:
[Same data as above]
AUDIENCE: Data science peer review
Please write a technical summary that:
1. Reports effect size and confidence interval, not just p-value
2. Discusses potential confounding factors (seasonality, novelty effect, selection bias)
3. Suggests follow-up analyses that would strengthen the conclusion
4. Evaluates practical significance (is an 18% lift meaningful given costs?)
5. Cites the methodology (e.g., two-proportion z-test with continuity correction)
Format as bullet points. Be precise about assumptions and limitations.
Data Analysis Ethics
- Do not overclaim: AI loves to find patterns. Ask it explicitly about confounding variables and alternative explanations.
- Check the math: Always verify statistical calculations independently before presenting them.
- Context matters: A 5% conversion lift is meaningless without knowing baseline, sample size, and cost.
- Watch for p-hacking: If you run 20 tests, one will show significance by chance. Ask the AI to correct for multiple comparisons.
- Sample bias: AI often assumes data is representative. If your sample is biased (self-selected, geographically limited, time-bound), the analysis is wrong no matter how sophisticated.
Common Data Analysis Mistakes When Prompting AI
- Not providing data context: A prompt that says "analyze this data" without describing what the columns mean produces generic, unhelpful output.
- Forgetting to ask for visualizations: AI defaults to text summaries. Explicitly request charts, tables, or specific plot types.
- Assuming AI knows your industry: "Analyze this sales data" is different for SaaS vs. e-commerce vs. B2B services. Specify.
- Not asking about limitations: The most valuable part of an analysis is often what is NOT in the data. Ask explicitly.
Next Steps
Expand your analytical toolkit with our ChatGPT Tips and Claude Tips guides, which include platform-specific features for working with data.