LLMs

498 bookmarks

Custom sorting

AI-Infused Development Needs More Than Prompts

Why intent and control are becoming the new software architecture

·oreilly.com·Apr 11, 2026

AI-Infused Development Needs More Than Prompts

Your Agent Is 80% Plumbing. Here Are the 12 Pieces You're Missing.

Watch now | Everyone’s talking about the Tamagotchi -- here’s what actually matters.

·natesnewsletter.substack.com·Apr 9, 2026

Your Agent Is 80% Plumbing. Here Are the 12 Pieces You're Missing.

Who’s the Admin, Me or Claude?

Credit: Museums Victoria / Unsplash There’s a lot of conversation right now about “context engineering” for dev work; structuring what you feed an LLM so it can do useful things. …

·cate.blog·Apr 7, 2026

Who’s the Admin, Me or Claude?

Mastering Caching Methods in Large Language Models (LLMs)

Large Language Models (LLMs) like OpenAI’s GPT-4 have transformed natural language processing, enabling applications ranging from chatbots…

·masteringllm.medium.com·Apr 6, 2026

Mastering Caching Methods in Large Language Models (LLMs)

How to Implement Effective LLM Caching

A deep dive into effective caching strategies for building scalable and cost-efficient LLM applications, covering exact key vs. semantic caching, architectural patterns, and practical implementation tips.

·helicone.ai·Apr 6, 2026

How to Implement Effective LLM Caching

Build an Inference Cache to Save Costs in High-Traffic LLM Apps - MachineLearningMastery.com

In this article, you will learn how to add both exact-match and semantic inference caching to large language model applications to reduce latency and API costs at scale.

·machinelearningmastery.com·Apr 6, 2026

Build an Inference Cache to Save Costs in High-Traffic LLM Apps - MachineLearningMastery.com

How Vision Language Models Are Trained from “Scratch” | Towards Data Science

A deep dive into exactly how text-only language models are finetuned to *see* images

·towardsdatascience.com·Apr 6, 2026

How Vision Language Models Are Trained from “Scratch” | Towards Data Science

I Replaced Vector DBs with Google’s Memory Agent Pattern for my notes in Obsidian | Towards Data Science

Persistent AI memory without embeddings, Pinecone, or a PhD in similarity search.

·towardsdatascience.com·Apr 3, 2026

I Replaced Vector DBs with Google’s Memory Agent Pattern for my notes in Obsidian | Towards Data Science

How to Build Custom Tokenizers for Domain-Specific LLMs | Markaicode

Build custom tokenizers for domain-specific LLMs to improve performance. Learn tokenization techniques, training methods, and implementation steps.

·markaicode.com·Apr 2, 2026

How to Build Custom Tokenizers for Domain-Specific LLMs | Markaicode

Anatomy of the .claude/ Folder

A complete guide to CLAUDE.md, custom commands, skills, agents, and permissions, and how to set them up properly.

·blog.dailydoseofds.com·Apr 1, 2026

Anatomy of the .claude/ Folder

We built an org-wide AI agent in 4 days. Here's what broke in the weeks after.

We built a 29K-line org AI agent in 4 days with Codex. Here's what broke after launch: credential leaks, silent event-loop deaths, and a teammate who keeps crashing it.

·daily.dev·Apr 1, 2026

We built an org-wide AI agent in 4 days. Here's what broke in the weeks after.

Fixing Claude with Claude: Anthropic reports on AI site reliability engineering

QCon London A member of Anthropic's AI reliability engineering team spoke at QCon London&n ...

·devclass.com·Mar 29, 2026

Fixing Claude with Claude: Anthropic reports on AI site reliability engineering

Translating non-trivial codebases with Claude

·blog.danieljanus.pl·Mar 29, 2026

Translating non-trivial codebases with Claude

The most expensive coordination cost in product development just got a fix. It's a markdown file.

Watch now | You’ve seen the videos.

·natesnewsletter.substack.com·Mar 29, 2026

The most expensive coordination cost in product development just got a fix. It's a markdown file.

Reaching MLE (machine learning enlightenment)

What is this job about really?

·vickiboykis.com·Mar 29, 2026

Reaching MLE (machine learning enlightenment)

Harper's Policy on Agent PRs

The goal of this page is to formalize my answer so that we can judiciously deal with patch requests produced by LLMs.

·elijahpotter.dev·Mar 29, 2026

Harper's Policy on Agent PRs

How to Build a General-Purpose AI Agent in 131 Lines of Python

Implement a coding agent in 131 lines of Python code, and a search agent in 61 lines

·oreilly.com·Mar 25, 2026

How to Build a General-Purpose AI Agent in 131 Lines of Python

Why AI Coding Agents Waste Half Their Context Window — Stoneforge Blog

AI coding agents burn 20-40% of their context window on codebase exploration. Structured documentation cuts orientation from 20 tool calls to 3.

·stoneforge.ai·Mar 25, 2026

Why AI Coding Agents Waste Half Their Context Window — Stoneforge Blog

The 8 Levels of Agentic Engineering — Bassim Eledath

AI's coding ability is outpacing our ability to wield it effectively. That gap closes in levels — 8 of them. Here's the progression from tab complete to autonomous agent teams.

·bassimeledath.com·Mar 25, 2026

The 8 Levels of Agentic Engineering — Bassim Eledath

Coding Agents Suck at the XY Problem

No longer do we have anyone to question what you’re trying to accomplish.

·bhavesh.dev·Mar 25, 2026

Coding Agents Suck at the XY Problem

Top AI coding tools make mistakes one in four times, study shows

New research from the University of Waterloo shows that artificial intelligence (AI) still struggles with some basic software development tasks, raising questions about how reliably AI systems can assist ...

·techxplore.com·Mar 25, 2026

Top AI coding tools make mistakes one in four times, study shows

Meditation, Language, and LLMs — Roden newsletter issue 112

The resolution of a human is much higher than we think

·craigmod.com·Mar 25, 2026

Meditation, Language, and LLMs — Roden newsletter issue 112

Claude Skill incoming! Generating Postman collections with AI

When speed matters more than perfection, API documentation can quickly become a bottleneck. In this post, I share how we used thoughtbot’s Claude Skill to generate Postman collections directly from a Rails codebase.

·thoughtbot.com·Mar 25, 2026

Claude Skill incoming! Generating Postman collections with AI

I copied a prompt and built a management system in a week

How I used Claude Code to build a GTD-based management system from a borrowed idea, meeting transcripts, and three days of real work.

·thoughtbot.com·Mar 25, 2026

I copied a prompt and built a management system in a week

Cursor Ships Composer 2: Frontier-Level Coding Performance at a Fraction of the Cost - DevOps.com

Cursor Composer 2 delivers frontier coding benchmarks at a lower cost than its predecessor. A new cost-to-intelligence ratio for AI agents.

·devops.com·Mar 25, 2026

Cursor Ships Composer 2: Frontier-Level Coding Performance at a Fraction of the Cost - DevOps.com

How LLMs make Git and GitHub easier to use and learn

I once wrote an article with the optimistic title GitHub for the rest of us. The idea was that everyone who works with others on collections of shared documents needs a powerful and easy way to see…

·blog.jonudell.net·Mar 13, 2026

How LLMs make Git and GitHub easier to use and learn

How to Create a Website Walkthrough Video With AI Talking Avatar

Transform your video storytelling: actionable tips and secrets to create viral TikTok, YouTube Shorts & Instagram Reels effortlessly.

·revid.ai·Mar 13, 2026

How to Create a Website Walkthrough Video With AI Talking Avatar

The Complete Playwright End-to-End Story, Tools, AI, and Real-World Workflows - Microsoft for Developers

1. Introduction End-to-end testing has evolved dramatically, and Playwright stands at the forefront. Playwright offers a full ecosystem empowering developers to write, debug, and maintain tests with speed and reliability. From its powerful test runner to rich developer tools like the VS Code extension, Codegen, UI Mode, and Trace Viewer, Playwright covers every phase of […]

·developer.microsoft.com·Mar 13, 2026

The Complete Playwright End-to-End Story, Tools, AI, and Real-World Workflows - Microsoft for Developers

Building Smart Web Automation Bots with Playwright and OpenAI

Building Smart Web Automation Bots with Playwright and OpenAI API A practical guide to...

·dev.to·Mar 13, 2026

Building Smart Web Automation Bots with Playwright and OpenAI

Web Automation Using AI: A Practical Guide with Playwright, GitHub Copilot, and MCP

If you’ve spent any decent amount of time writing browser tests, you’ll know what I mean when I say — it can be painfully repetitive. I’ve…

·codestax.medium.com·Mar 13, 2026

Web Automation Using AI: A Practical Guide with Playwright, GitHub Copilot, and MCP