页面画廊

从 Anthropic sitemap 抓取的 340带插图页面。图片分两类: cdn.sanity.io 上的定制插图;以及由 /api/opengraph-illustration?name=… 接口生成的符号插图。

共 340 个

News & Announcements203

Introducing 100K Context Windows

Introducing 100K Context Windows

· /news/100k-context-windows

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A new initiative for developing third-party model evaluations

A new initiative for developing third-party model evaluations

· /news/a-new-initiative-for-developing-third-party-model-evaluations

A robust, third-party evaluation ecosystem is essential for assessing AI capabilities and risks, but the current evaluations landscape is limited. Developing high-quality, safety-r

Activating AI Safety Level 3 protections

Activating AI Safety Level 3 protections

· /news/activating-asl3-protections

We have activated the AI Safety Level 3 (ASL-3) Deployment and Security Standards described in Anthropic’s Responsible Scaling Policy (RSP) in conjunction with launching Claude Opu

Advancing Claude for Education

Advancing Claude for Education

· /news/advancing-claude-for-education

A first look at new education-specific integrations, expanded student programs, and university updates.

Advancing Claude for Financial Services

Advancing Claude for Financial Services

· /news/advancing-claude-for-financial-services

Claude for Financial Services now supports a native Excel plug-in, new connectors to real-time market, and pre-built skills for modeling, comp analysis, and earnings reports.

Announcing our updated Responsible Scaling Policy

Announcing our updated Responsible Scaling Policy

· /news/announcing-our-updated-responsible-scaling-policy

Today we are publishing a significant update to our Responsible Scaling Policy (RSP), the risk governance framework we use to mitigate potential catastrophic risks from frontier AI

Anthropic achieves ISO 42001 certification for responsible AI

Anthropic achieves ISO 42001 certification for responsible AI

· /news/anthropic-achieves-iso-42001-certification-for-responsible-ai

We are excited to announce that Anthropic has achieved accredited certification under the new ISO/IEC 42001:2023 standard for our AI management system. ISO 42001 is the first inter

Anthropic awarded $200M DOD agreement for AI capabilities

Anthropic awarded $200M DOD agreement for AI capabilities

· /news/anthropic-and-the-department-of-defense-to-advance-responsible-ai-in-defense-operations

The U.S. Department of Defense (DOD), through its Chief Digital and Artificial Intelligence Office (CDAO), has awarded Anthropic a two-year prototype other transaction agreement wi

Anthropic partners with BCG

Anthropic partners with BCG

· /news/anthropic-bcg

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Education Report: How University Students Use Claude

Anthropic Education Report: How University Students Use Claude

· /news/anthropic-education-report-how-university-students-use-claude

AI systems are no longer just specialized research tools: they’re everyday academic companions. As AIs integrate more deeply into educational environments, we need to consider impo

Anthropic is endorsing SB 53

Anthropic is endorsing SB 53

· /news/anthropic-is-endorsing-sb-53

Anthropic is endorsing SB 53, the California bill that governs powerful AI systems built by frontier AI developers like Anthropic.

Anthropic Partners with Google Cloud

Anthropic Partners with Google Cloud

· /news/anthropic-partners-with-google-cloud

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic raises Series E at $61.5B post-money valuation

Anthropic raises Series E at $61.5B post-money valuation

· /news/anthropic-raises-series-e-at-usd61-5b-post-money-valuation

Anthropic has raised $3.5 billion at a $61.5 billion post-money valuation. The round was led by Lightspeed Venture Partners, with participation from Bessemer Venture Partners, Cisc

Anthropic raises $13B Series F at $183B post-money valuation

Anthropic raises $13B Series F at $183B post-money valuation

· /news/anthropic-raises-series-f-at-usd183b-post-money-valuation

Anthropic has completed a Series F fundraising of $13 billion led by ICONIQ. This financing values Anthropic at $183 billion post-money. Along with ICONIQ, the round was co-led by

Anthropic joins White House pledge for AI education

Anthropic joins White House pledge for AI education

· /news/anthropic-signs-pledge-to-americas-youth-investing-in-ai-education

Anthropic signs White House pledge investing in AI education for America's youth, supporting AI students and educators nationwide

Anthropic's Responsible Scaling Policy

Anthropic's Responsible Scaling Policy

· /news/anthropics-responsible-scaling-policy

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Building safeguards for Claude

Building safeguards for Claude

· /news/building-safeguards-for-claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Challenges in Red Teaming AI Systems

Challenges in Red Teaming AI Systems

· /news/challenges-in-red-teaming-ai-systems

In this post we detail insights from a sample of red teaming approaches that we’ve used to test our AI systems. Through this practice, we’ve begun to gather empirical data about th

Charting a Path to AI Accountability

Charting a Path to AI Accountability

· /news/charting-a-path-to-ai-accountability

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Aligning on child safety principles

Aligning on child safety principles

· /news/child-safety-principles

Alongside other leading AI companies, we’re committed to implementing robust child safety measures in the development, deployment, and maintenance of generative AI technologies.

Introducing Claude 2.1

Introducing Claude 2.1

· /news/claude-2-1

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude 2

Claude 2

· /news/claude-2

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude 3.5 Sonnet

Introducing Claude 3.5 Sonnet

· /news/claude-3-5-sonnet

Introducing Claude 3.5 Sonnet—our most intelligent model yet. Sonnet now outperforms competitor models and Claude 3 Opus on key evaluations, at twice the speed.

Claude 3.7 Sonnet and Claude Code

Claude 3.7 Sonnet and Claude Code

· /news/claude-3-7-sonnet

Today, we’re announcing Claude 3.7 Sonnet, our most intelligent model to date and the first hybrid reasoning model generally available on the market.

Introducing the next generation of Claude

Introducing the next generation of Claude

· /news/claude-3-family

Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in a

Claude 3 Haiku: our fastest model yet

Claude 3 Haiku: our fastest model yet

· /news/claude-3-haiku

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude 4

Introducing Claude 4

· /news/claude-4

Discover Claude 4's breakthrough AI capabilities. Experience more reliable, interpretable assistance for complex tasks across work and learning.

Claude and Alexa+

Claude and Alexa+

· /news/claude-and-alexa-plus

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude is now available in Brazil

Claude is now available in Brazil

· /news/claude-brazil

Claude, Anthropic’s trusted AI assistant, is now available in Brazil. Starting today, consumers and businesses in Brazil will be able to access Claude.

Claude Code and new admin controls for business plans

Claude Code and new admin controls for business plans

· /news/claude-code-on-team-and-enterprise

Enterprise and Team customers can now upgrade to premium seats that include more usage and Claude Code—bringing our app and powerful coding agent together under one subscription.

Introducing Claude Design by Anthropic Labs

Introducing Claude Design by Anthropic Labs

· /news/claude-design-anthropic-labs

Today, we’re launching Claude Design, a new Anthropic Labs product that lets you collaborate with Claude to create polished visual work like designs, prototypes, slides, one-pagers

Claude is now available in the EU

Claude is now available in the EU

· /news/claude-europe

We’re excited to announce that Claude, Anthropic’s trusted AI assistant, is now available for people and businesses across Europe to enhance their productivity and creativity.

Claude for Financial Services

Claude for Financial Services

· /news/claude-for-financial-services

Today, we're introducing a comprehensive solution for financial analysis that transforms how finance professionals analyze markets, conduct research, and make investment decisions

Claude for Life Sciences

Claude for Life Sciences

· /news/claude-for-life-sciences

Discover how Claude accelerates life sciences research with new scientific connectors, skills, and improved performance for drug discovery and clinical work.

Introducing Claude for Nonprofits

Introducing Claude for Nonprofits

· /news/claude-for-nonprofits

Anthropic launches Claude for Nonprofits to help organizations maximize their impact, featuring free AI training and discounted rates for nonprofits.

Claude is now generally available in Xcode

Claude is now generally available in Xcode

· /news/claude-in-xcode

Connect your Claude account to Xcode 26 for AI-powered coding assistance. Debug, refactor, and build Apple apps faster with Claude Sonnet 4 by Anthropic.

Claude is a space to think | Anthropic

Claude is a space to think | Anthropic

· /news/claude-is-a-space-to-think

We’ve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access with

Claude's new constitution

Claude's new constitution

· /news/claude-new-constitution

A new approach to a foundational document that expresses and shapes who Claude is

Claude Opus 4.1

Claude Opus 4.1

· /news/claude-opus-4-1

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude Opus 4.5

Introducing Claude Opus 4.5

· /news/claude-opus-4-5

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude Opus 4.6

Claude Opus 4.6

· /news/claude-opus-4-6

We’re upgrading our smartest model. Across agentic coding, computer use, tool use, search, and finance, Opus 4.6 is an industry-leading model, often by wide margin.

Introducing Claude Opus 4.7

Introducing Claude Opus 4.7

· /news/claude-opus-4-7

Our latest model, Claude Opus 4.7, is now generally available. Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most dif

Introducing Claude Pro

Introducing Claude Pro

· /news/claude-pro

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude Sonnet 4.5

Introducing Claude Sonnet 4.5

· /news/claude-sonnet-4-5

Claude Sonnet 4.5 is the best coding model in the world, strongest model for building complex agents, and best model at using computers.

Introducing Sonnet 4.6

Introducing Sonnet 4.6

· /news/claude-sonnet-4-6

Claude Sonnet 4.6 is a full upgrade of the model’s skills across coding, computer use, long-reasoning, agent planning, knowledge work, and design.

Claude’s Constitution

Claude’s Constitution

· /news/claudes-constitution

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Contextual Retrieval in AI Systems

Contextual Retrieval in AI Systems

· /news/contextual-retrieval

Explore how Anthropic enhances AI systems through advanced contextual retrieval methods. Learn about our approach to improving information access and relevance in large language mo

Core Views on AI Safety: When, Why, What, and How

Core Views on AI Safety: When, Why, What, and How

· /news/core-views-on-ai-safety

AI progress may lead to transformative AI systems in the next decade, but we do not yet understand how to make such systems safe and aligned with human values. In response, we are

Anthropic Deloitte Partnership

Anthropic Deloitte Partnership

· /news/deloitte-anthropic-partnership

Deloitte will make Claude available to 470,000 people across its global network. Anthropic's largest enterprise AI deployment to date. Partner with Anthropic because Claude is buil

An update on our election safeguards

An update on our election safeguards

· /news/election-safeguards-update

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Enabling Claude Code to work more autonomously

Enabling Claude Code to work more autonomously

· /news/enabling-claude-code-to-work-more-autonomously

Introducing Claude Code upgrades: native VS Code extension, terminal UX updates, and checkpoints for autonomous development. Handle complex tasks with confidence.

Expanding Access to Claude for Government

Expanding Access to Claude for Government

· /news/expanding-access-to-claude-for-government

Anthropic's mission is to build reliable, interpretable, steerable AI systems. We have been excited to see our technology used in areas like coding, customer service, drug discover

Frontier Model Security

Frontier Model Security

· /news/frontier-model-security

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Frontier Threats Red Teaming for AI Safety

Frontier Threats Red Teaming for AI Safety

· /news/frontier-threats-red-teaming-for-ai-safety

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude 3.5 Sonnet on GitHub Copilot

Claude 3.5 Sonnet on GitHub Copilot

· /news/github-copilot

Starting today, the new Claude 3.5 Sonnet begins rolling out on GitHub Copilot, enabling developers to choose Claude 3.5 Sonnet for coding—directly in Visual Studio Code and GitHub

Golden Gate Claude

Golden Gate Claude

· /news/golden-gate-claude

When we turn up the strength of the “Golden Gate Bridge” feature, Claude’s responses begin to focus on the Golden Gate Bridge. For a short time, we’re making this model available f

Claude 3 models on Vertex AI

Claude 3 models on Vertex AI

· /news/google-vertex-general-availability

Claude 3 Haiku and Claude 3 Sonnet are now generally available on Google Cloud’s Vertex AI platform.

Advancing Claude in healthcare and the life sciences

Advancing Claude in healthcare and the life sciences

· /news/healthcare-life-sciences

Introducing Claude for Healthcare with HIPAA-ready infrastructure, plus expanded Life Sciences tools for clinical trials and regulatory submissions. New connectors to CMS, Medidata

Introducing Labs

Introducing Labs

· /news/introducing-anthropic-labs

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Anthropic's Transparency Hub

Introducing Anthropic's Transparency Hub

· /news/introducing-anthropic-transparency-hub

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude to Canada

Introducing Claude to Canada

· /news/introducing-claude-to-canada

Claude is now available in Canada. Starting today, people and businesses across the country will be able to access Claude.

Introducing Claude

Introducing Claude

· /news/introducing-claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing the Anthropic Economic Advisory Council

Introducing the Anthropic Economic Advisory Council

· /news/introducing-the-anthropic-economic-advisory-council

Today, we’re announcing the formation of the Anthropic Economic Advisory Council, a group of distinguished economists who will provide Anthropic with expert guidance on the economi

Anthropic Economic Futures Program Launch

Anthropic Economic Futures Program Launch

· /news/introducing-the-anthropic-economic-futures-program

Anthropic's new research initiative exploring AI's impact on the future of work and economy, developing policy frameworks for a changing workforce.

Investing in energy to secure America's AI future

Investing in energy to secure America's AI future

· /news/investing-in-energy-to-secure-america-s-ai-future

Energy is central to winning the AI race and we need to ensure that America has the necessary infrastructure to maintain its lead. The importance of building this infrastructure go

Jay Kreps appointed to Anthropic's Board of Directors

Jay Kreps appointed to Anthropic's Board of Directors

· /news/jay-kreps-appointed-to-board-of-directors

Today, we're announcing that Jay Kreps, co-founder and CEO of Confluent, has joined Anthropic's Board of Directors. Jay's extensive experience in building and scaling highly succes

Claude for Enterprise Powers LLNL Research

Claude for Enterprise Powers LLNL Research

· /news/lawrence-livermore-national-laboratory-expands-claude-for-enterprise-to-empower-scientists-and

Lawrence Livermore National Laboratory expands Claude for Enterprise access to 10,000 scientists, accelerating breakthroughs in energy, and national security research.

Introducing the Model Context Protocol

Introducing the Model Context Protocol

· /news/model-context-protocol

The Model Context Protocol (MCP) is an open standard for connecting AI assistants to the systems where data lives, including content repositories, business tools, and development e

Expanding our model safety bug bounty program

Expanding our model safety bug bounty program

· /news/model-safety-bug-bounty

The rapid progression of AI model capabilities demands an equally swift advancement in safety protocols. As we work on developing the next generation of our AI safeguarding systems

Understanding and Addressing AI Harms

Understanding and Addressing AI Harms

· /news/our-approach-to-understanding-and-addressing-ai-harms

Learn about Anthropic's comprehensive framework for identifying, classifying, and mitigating potential harms from AI systems, ensuring responsible development of advanced AI techno

Measuring political bias in Claude

Measuring political bias in Claude

· /news/political-even-handedness

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Preparing for global elections in 2024

Preparing for global elections in 2024

· /news/preparing-for-global-elections-in-2024

In this post, we’ll discuss some of the specific steps we’ve taken to help us detect and mitigate potential misuse of our AI tools in political contexts.

Collaborate with Claude on Projects

Collaborate with Claude on Projects

· /news/projects

Claude Pro and Team users can now organize chats into Projects. Projects bring together internal knowledge and chat activity in one place so Claude can be your go-to expert for gen

Prompt engineering for business performance

Prompt engineering for business performance

· /news/prompt-engineering-for-business-performance

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Protecting the wellbeing of our users

Protecting the wellbeing of our users

· /news/protecting-well-being-of-users

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Reflections on our Responsible Scaling Policy

Reflections on our Responsible Scaling Policy

· /news/reflections-on-our-responsible-scaling-policy

Last summer we published our first Responsible Scaling Policy (RSP), which focuses on addressing catastrophic safety failures and misuse of frontier models. In adopting this policy

Releasing Claude Instant 1.2

Releasing Claude Instant 1.2

· /news/releasing-claude-instant-1-2

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic's AI Export Controls Framework Response

Anthropic's AI Export Controls Framework Response

· /news/securing-america-s-compute-advantage-anthropic-s-position-on-the-diffusion-rule

Anthropic submits detailed recommendations for strengthening US export controls on advanced AI chips and model weights. We advocate for maintaining America's compute advantage, adj

SKT Partnership Announcement

SKT Partnership Announcement

· /news/skt-partnership-announcement

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Progress from our Frontier Red Team

Progress from our Frontier Red Team

· /news/strategic-warning-for-ai-risk-progress-and-insights-from-our-frontier-red-team

In this post, we are sharing what we have learned about the trajectory of potential national security risks from frontier AI models, along with some of our thoughts about challenge

Testing our safety defenses with a new bug bounty program

Testing our safety defenses with a new bug bounty program

· /news/testing-our-safety-defenses-with-a-new-bug-bounty-program

Today, we're launching a new bug bounty program to stress-test our latest safety measures, in partnership with HackerOne. Similar to the program we announced last summer, we're cha

Introducing The Anthropic Institute

Introducing The Anthropic Institute

· /news/the-anthropic-institute

We’re launching The Anthropic Institute, a new effort to confront the most significant challenges that powerful AI will pose to our societies.

The case for targeted regulation

The case for targeted regulation

· /news/the-case-for-targeted-regulation

Increasingly powerful AI systems have the potential to accelerate scientific progress, unlock new medical treatments, and grow the economy. But along with the remarkable new capabi

The Long-Term Benefit Trust

The Long-Term Benefit Trust

· /news/the-long-term-benefit-trust

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A framework for AI development transparency

A framework for AI development transparency

· /news/the-need-for-transparency-in-frontier-ai

A targeted approach to increasing transparency in frontier AI development, focusing on safety standards and accountability measures for advanced AI systems.

Third-party testing as a key ingredient of AI policy

Third-party testing as a key ingredient of AI policy

· /news/third-party-testing

We believe that the AI sector needs effective third-party testing for frontier AI systems. Developing a testing regime and associated policy interventions based on the insights of

Thoughts on America’s AI Action Plan

Thoughts on America’s AI Action Plan

· /news/thoughts-on-america-s-ai-action-plan

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Updating our Usage Policy

Updating our Usage Policy

· /news/updating-our-usage-policy

We're updating the policies that protect our users and ensure our products and services are used responsibly.

U.S. Elections Readiness

U.S. Elections Readiness

· /news/us-elections-readiness

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Usage Policy Update

Usage Policy Update

· /news/usage-policy-update

Updates to our Usage Policy that reflect the growing capabilities and evolving usage of our products

Research113

Anthropic Education Report: The AI Fluency Index

Anthropic Education Report: The AI Fluency Index

· /research/AI-fluency-index

Anthropic's AI Fluency Index measures 11 observable behaviors across thousands of Claude.ai conversations to understand how people develop AI collaboration skills.

Anthropic Economic Index report: Economic primitives

Anthropic Economic Index report: Economic primitives

· /research/anthropic-economic-index-january-2026-report

This report introduces new metrics of AI usage to provide a rich portrait of interactions with Claude in November 2025, just prior to the release of Opus 4.5.

Building Effective AI Agents

Building Effective AI Agents

· /research/building-effective-agents

Discover how Anthropic approaches the development of reliable AI agents. Learn about our research on agent capabilities, safety considerations, and technical framework for building

Circuits Updates – April 2024

Circuits Updates – April 2024

· /research/circuits-updates-april-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – August 2024

Circuits Updates – August 2024

· /research/circuits-updates-august-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – July 2024

Circuits Updates – July 2024

· /research/circuits-updates-july-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – June 2024

Circuits Updates – June 2024

· /research/circuits-updates-june-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates — May 2023

Circuits Updates — May 2023

· /research/circuits-updates-may-2023

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – September 2024

Circuits Updates – September 2024

· /research/circuits-updates-sept-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude’s Character

Claude’s Character

· /research/claude-character

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Confidential Inference via Trusted Virtual Machines

Confidential Inference via Trusted Virtual Machines

· /research/confidential-inference-trusted-vms

Announcing a new collaborative research paper on Confidential Inference, a set of tools to improve the security of our model weights and of our users' data

Insights on Crosscoder Model Diffing

Insights on Crosscoder Model Diffing

· /research/crosscoder-model-diffing

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Challenges in evaluating AI systems

Challenges in evaluating AI systems

· /research/evaluating-ai-systems

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Exploring model welfare

Exploring model welfare

· /research/exploring-model-welfare

Announcing a new research program at Anthropic on model welfare

In-context Learning and Induction Heads

In-context Learning and Induction Heads

· /research/in-context-learning-and-induction-heads

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Interpretability Dreams

Interpretability Dreams

· /research/interpretability-dreams

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing our Science Blog

Introducing our Science Blog

· /research/introducing-anthropic-science

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Language Models (Mostly) Know What They Know

Language Models (Mostly) Know What They Know

· /research/language-models-mostly-know-what-they-know

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Many-shot jailbreaking

Many-shot jailbreaking

· /research/many-shot-jailbreaking

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Mapping the Mind of a Large Language Model

Mapping the Mind of a Large Language Model

· /research/mapping-mind-language-model

We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, p

Measuring AI agent autonomy in practice

Measuring AI agent autonomy in practice

· /research/measuring-agent-autonomy

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Measuring the Persuasiveness of Language Models

Measuring the Persuasiveness of Language Models

· /research/measuring-model-persuasiveness

Anthropic developed a way to test how persuasive language models (LMs) are, and analyzed how persuasiveness scales across different versions of Claude.

Open-sourcing circuit-tracing tools

Open-sourcing circuit-tracing tools

· /research/open-source-circuit-tracing

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Simple probes can catch sleeper agents

Simple probes can catch sleeper agents

· /research/probes-catch-sleeper-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Project Vend: Phase two

Project Vend: Phase two

· /research/project-vend-2

How Claude turned around its failing vending machine business

Softmax Linear Units

Softmax Linear Units

· /research/softmax-linear-units

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A statistical approach to model evaluations

A statistical approach to model evaluations

· /research/statistical-approach-to-model-evals

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude SWE-Bench Performance

Claude SWE-Bench Performance

· /research/swe-bench-sonnet

Explore Claude's breakthrough performance on SWE-Bench, demonstrating advanced software engineering capabilities and code generation accuracy. Learn about our technical evaluation

Alignment Research

Alignment Research

· /research/alignment

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Economic Research

Economic Research

· /research/economic-research

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Interpretability Research

Interpretability Research

· /research/interpretability

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Societal Impacts Research

Societal Impacts Research

· /research/societal-impacts

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Toy Models of Superposition

Toy Models of Superposition

· /research/toy-models-of-superposition

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Reflections on Qualitative Research

Reflections on Qualitative Research

· /research/transformer-circuits

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Trustworthy agents in practice

Trustworthy agents in practice

· /research/trustworthy-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Vibe physics: The AI grad student

Vibe physics: The AI grad student

· /research/vibe-physics

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Engineering24

A postmortem of three recent issues

A postmortem of three recent issues

· /engineering/a-postmortem-of-three-recent-issues

This is a technical report on three bugs that intermittently degraded responses from Claude. Below we explain what happened, why it took time to fix, and what we're changing.

Building Effective AI Agents

Building Effective AI Agents

· /engineering/building-effective-agents

Discover how Anthropic approaches the development of reliable AI agents. Learn about our research on agent capabilities, safety considerations, and technical framework for building

Contextual Retrieval in AI Systems

Contextual Retrieval in AI Systems

· /engineering/contextual-retrieval

Explore how Anthropic enhances AI systems through advanced contextual retrieval methods. Learn about our approach to improving information access and relevance in large language mo

Effective context engineering for AI agents

Effective context engineering for AI agents

· /engineering/effective-context-engineering-for-ai-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Effective harnesses for long-running agents

Effective harnesses for long-running agents

· /engineering/effective-harnesses-for-long-running-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Equipping agents for the real world with Agent Skills

Equipping agents for the real world with Agent Skills

· /engineering/equipping-agents-for-the-real-world-with-agent-skills

Discover how Anthropic builds AI agents with practical capabilities through modular skills, enabling them to handle complex real-world tasks more effectively and reliably.

Claude SWE-Bench Performance

Claude SWE-Bench Performance

· /engineering/swe-bench-sonnet

Explore Claude's breakthrough performance on SWE-Bench, demonstrating advanced software engineering capabilities and code generation accuracy. Learn about our technical evaluation