Which AI model shall we chose?

Published

March 6, 2026

In what follows, here is my personal take as of date:2026-03-06.

Basics

Generative AI based on Large Language Models (genAI) is a powerful tool with a series of decisions by users. This is my guide with a focus on task related to analyzing data.

Leading models

  • As of now the leading (state of the art, SOTA) models are Google’s Gemini (3.0), OpenAI’s ChatGPT (5.4), and Anthropic’s Claude Opus (4.6.)
  • Other key model is Mistral AI but there are several others
  • Open source – open weights models are also quite good, Kimi (K2.5), an open weight model, is quite good and free. Deepseek, Qwen the other leading models. Think of them as SOTA 4-9 months ago

Thinking

  • Each model have faster (cheaper) and thinking (more expensive) variants, such as Sonnet and Opus for Claude, Thinking and Instant for ChatGPT. In addition you can have extended thinking
  • For data analysis work, the thinking variants are recommended.
  • All major models have deep research

Tools

  • All major models support tool use (e.g. web browsing, code execution) and agentic patterns (multi-step workflows with memory and tool use).
  • Currently Gemini is the weakest here

Use cases

AI is great for many tasks. In this course we only focus on aspects of Data Analysis:

  • designing analysis
  • writing code
  • data wrangling such as joining tables, sample design, variable transformations
  • exploratory data analysis
  • modelling, regressions, machine learning
  • causal inference
  • creating tables and graphs
  • writing reports and presentations

Models for data analysis

For data-analysis projects, my current recommended approach is: paid Claude.

Research & Writing

  • Synthesis Over Summarization: AI tools increasingly synthesize multi-source inputs into structured insights rather than paraphrasing single documents.
  • Security & Privacy: Modern workspaces rely on isolated execution contexts; strong non-training guarantees apply primarily to paid and enterprise tiers.
  • Multimodal Capability: AI can interpret charts, screenshots, and handwritten notes and incorporate them into drafts.

For data analysis workspaces – comparison

Last update: 16 Jan 2026

Workspace 2026 Key Features
Anthropic Claude Artifacts • Creates interactive applications (tutors, calculators) within the output window.
• Real-time iteration on complex document structures.
OpenAI ChatGPT Canvas • Advanced frontier models with contextual persistence for tone and style.
• Inline editing with granular control over specific sections.
Google NotebookLM Interactive Audio Overviews with user interruption and questioning.
• Grounded citations linked directly to uploaded source segments.
Perplexity Pages • Multi-source synthesis using live web retrieval.
• Inline citation and consistency checking against sources.

Data Analysis details

  • Sandboxed Execution: Code runs in secure, ephemeral environments with no local system access.
  • Statistical Rigor: Strong support for Python-based libraries (e.g. pandas, scikit-learn) for exploratory and predictive analysis.
  • Direct Integration: AI can manipulate data directly within spreadsheets or dedicated analysis windows.
  • Limits: Reproducibility, package versions, and state persistence remain constrained relative to local workflows.

Data Analysis details

Workspace 2026 Key Features for Analysis
ChatGPT Data Analysis • Executes Python in managed compute environments for multi-file datasets.
• Assisted data cleaning and predictive modeling workflows.
Claude Analysis • High-fidelity SVG and lightweight interactive output.
• Fast iteration on statistical tables with publication-ready formatting.
Google Gemini in Sheets Multimodal cleaning: converts screenshots or PDFs into structured tables.
• Natural-language formula generation and transformations.
Microsoft Copilot in Excel Native Python-in-Excel for statistical scripts inside spreadsheets.
• Automated pivots, summaries, and forecasting via prompts.

Coding Assistance

  • Agent-Assisted Workflows: AI can coordinate multi-step tasks such as refactoring or bug fixing across large codebases, with human review.
  • Environment Security: Code is tested in secure sandboxes before changes are proposed.
  • Interconnected Tools: Integrated with development and collaboration platforms (e.g. Jira, Slack).

Coding Assistance — comparison

Workspace 2026 Key Features for Developers
Anthropic Claude Code • Native VS Code extension with agent-style workflows and inline diffs.
• Supports complex multi-file edits and testing assistance.
GitHub Copilot • Uses Extensions to interact with external dev tools (Azure, Slack, Jira).
• Deep context from local and remote repositories.
Cursor • AI-first editor with awareness of project-wide dependencies.
• Strong support for iterative refactors across files.
Windsurf (Codeium) Cascade mode for orchestrating large-scale refactors.
• Robust free tier for students and individual users.

Security note: SOC2 compliance is common; strict zero-retention guarantees typically apply to enterprise or explicitly configured accounts.

What changed from 2025 Q2

  • No need to think much re which models to use.
  • Leading models similar capability, but different. Not really sure how…

Feedback

Dear Reader. I have limited experience. Suggestions are welcome, please post an issue.