Blog

Practical articles about building with Anthropic's Claude — from API integration and prompt engineering to workflow automation, tool use, and agentic patterns. Real code, real benchmarks, honest takes.

Frequently asked

About this blog

Quick answers to the questions readers ask most often about how Claudinhos approaches Claude tutorials, comparisons, and production patterns.

01

What kind of articles do you publish?

Hands-on Claude tutorials, model benchmarks (SWE-Bench Verified, HumanEval, Terminal-Bench), head-to-head comparisons against GPT, Gemini, and GitHub Copilot, prompt engineering recipes, MCP integration guides, and production patterns for agentic workflows. Each post ships with code you can adapt to your own stack.

02

How are the benchmarks and comparisons run?

We test on real codebases using identical prompts and identical task definitions across models. When we cite a number, we name the source, the date, and the methodology. When numbers come from third parties, we say so explicitly and link the original report so you can verify or update it yourself.

03

Who is this blog for?

Software engineers integrating Claude into products, startup founders evaluating AI infrastructure, and tech leads standardizing AI tooling across a team. The articles are written for people who need decisions backed by signal — benchmarks, methodology, and operational data — not vibes or marketing.

04

How often are new articles published?

New posts ship regularly across the same themes: model evaluation, prompt engineering, agentic systems, MCP and tool use, workflow automation, and engineering management decisions around AI tooling. Bookmark the page or follow tsunode on social to catch them as they go live.

05

Can I trust the numbers in your comparisons?

We cite SWE-Bench Verified, HumanEval, and other public benchmarks by source with dates and known limitations. When our own session data informs a claim, we say so and disclose sample size — it is our experience, not a controlled study. We never present third-party numbers as if we ran them ourselves.