
Introducing 100K Context Windows
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
从 Anthropic sitemap 抓取的 340 个带插图页面。图片分两类: cdn.sanity.io 上的定制插图;以及由 /api/opengraph-illustration?name=… 接口生成的符号插图。

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
A refreshed, more powerful Claude 3.5 Sonnet, Claude 3.5 Haiku, and a new experimental AI capability: computer use.

A robust, third-party evaluation ecosystem is essential for assessing AI capabilities and risks, but the current evaluations landscape is limited. Developing high-quality, safety-r
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic, AWS, and Accenture Team Up to Build Trusted Solutions for Enterprises
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
We have activated the AI Safety Level 3 (ASL-3) Deployment and Security Standards described in Anthropic’s Responsible Scaling Policy (RSP) in conjunction with launching Claude Opu
A first look at new education-specific integrations, expanded student programs, and university updates.
Claude for Financial Services now supports a native Excel plug-in, new connectors to real-time market, and pre-built skills for modeling, comp analysis, and earnings reports.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Today we are publishing a significant update to our Responsible Scaling Policy (RSP), the risk governance framework we use to mitigate potential catastrophic risks from frontier AI
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

We are excited to announce that Anthropic has achieved accredited certification under the new ISO/IEC 42001:2023 standard for our AI management system. ISO 42001 is the first inter
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic and Iceland announce national AI education pilot

The U.S. Department of Defense (DOD), through its Chief Digital and Artificial Intelligence Office (CDAO), has awarded Anthropic a two-year prototype other transaction agreement wi

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
The second update from the Anthropic Economic Index
Research on 74,000 educator conversations shows how faculty use Claude for teaching, research, and building interactive learning tools.
AI systems are no longer just specialized research tools: they’re everyday academic companions. As AIs integrate more deeply into educational environments, we need to consider impo
Chris Ciauri joins Anthropic as Managing Director of International, adding to our global leadership team as we expand our worldwide presence.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is endorsing SB 53, the California bill that governs powerful AI systems built by frontier AI developers like Anthropic.
NEC will deploy Claude to 30,000 employees and become Anthropic's first Japan-based global partner, co-developing AI products for finance, manufacturing, and government
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic partners with Menlo Ventures to launch Anthology Fund
Anthropic collaborates with the University of Chicago's Becker Friedman Institute to research AI's effects on labor markets, productivity, and economic distribution, enhancing our

We are proud to participate in the U.S. Department of Energy’s (DOE) first-ever 1,000 Scientist AI Jam, which will bring together scientists across multiple national laboratories t

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic has raised $3.5 billion at a $61.5 billion post-money valuation. The round was led by Lightspeed Venture Partners, with participation from Bessemer Venture Partners, Cisc
Anthropic has completed a Series F fundraising of $13 billion led by ICONIQ. This financing values Anthropic at $183 billion post-money. Along with ICONIQ, the round was co-led by
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic signs CMS health tech pledge
Anthropic signs White House pledge investing in AI education for America's youth, supporting AI students and educators nationwide
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Build AI in America: Anthropic Energy Report
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

In this post we detail insights from a sample of red teaming approaches that we’ve used to test our AI systems. Through this practice, we’ve begun to gather empirical data about th

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Alongside other leading AI companies, we’re committed to implementing robust child safety measures in the development, deployment, and maintenance of generative AI technologies.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude 3.5 Sonnet—our most intelligent model yet. Sonnet now outperforms competitor models and Claude 3 Opus on key evaluations, at twice the speed.
Today, we’re announcing Claude 3.7 Sonnet, our most intelligent model to date and the first hybrid reasoning model generally available on the market.

Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in a

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Discover Claude 4's breakthrough AI capabilities. Experience more reliable, interpretable assistance for complex tasks across work and learning.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude, Anthropic’s trusted AI assistant, is now available in Brazil. Starting today, consumers and businesses in Brazil will be able to access Claude.

Enterprise and Team customers can now upgrade to premium seats that include more usage and Claude Code—bringing our app and powerful coding agent together under one subscription.
Claude Code Security is one step towards our goal of more secure codebases and a higher security baseline across the industry.
Today, we’re launching Claude Design, a new Anthropic Labs product that lets you collaborate with Claude to create polished visual work like designs, prototypes, slides, one-pagers

We’re excited to announce that Claude, Anthropic’s trusted AI assistant, is now available for people and businesses across Europe to enhance their productivity and creativity.
Today, we're introducing a comprehensive solution for financial analysis that transforms how finance professionals analyze markets, conduct research, and make investment decisions
Discover how Claude accelerates life sciences research with new scientific connectors, skills, and improved performance for drug discovery and clinical work.
Anthropic launches Claude for Nonprofits to help organizations maximize their impact, featuring free AI training and discounted rates for nonprofits.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Claude Haiku 4.5, our latest small model, is available today to all users.
Claude models are approved for use in FedRAMP High and DoD Impact Level 4 and 5 workloads through Amazon Bedrock in AWS GovCloud (US) regions. Federal agencies and defense organiza
Claude Sonnet 4.5, Haiku 4.5, and Opus 4.1 models are now available in public preview in Microsoft Foundry, where Azure customers can build production applications and enterprise a

Connect your Claude account to Xcode 26 for AI-powered coding assistance. Debug, refactor, and build Apple apps faster with Claude Sonnet 4 by Anthropic.
We’ve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access with
A new approach to a foundational document that expresses and shapes who Claude is
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

We’re upgrading our smartest model. Across agentic coding, computer use, tool use, search, and finance, Opus 4.6 is an industry-leading model, often by wide margin.

Our latest model, Claude Opus 4.7, is now generally available. Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most dif
We’re launching the Claude Partner Network, a program for partner organizations helping enterprises adopt Claude.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Claude Sonnet 4.5 is the best coding model in the world, strongest model for building complex agents, and best model at using computers.
Claude Sonnet 4.6 is a full upgrade of the model’s skills across coding, computer use, long-reasoning, agent planning, knowledge work, and design.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Explore how Anthropic enhances AI systems through advanced contextual retrieval methods. Learn about our approach to improving information access and relevance in large language mo
AI progress may lead to transformative AI systems in the next decade, but we do not yet understand how to make such systems safe and aligned with human values. In response, we are
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Deloitte will make Claude available to 470,000 people across its global network. Anthropic's largest enterprise AI deployment to date. Partner with Anthropic because Claude is buil
Detecting and Countering Malicious Uses of Claude
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic's threat intelligence report on AI cybercrime and other abuses

Developing a computer use model
Together with the NNSA and DOE national laboratories, we have co-developed a classifier—an AI system that automatically categorizes content—that distinguishes between concerning an
A report describing an a highly sophisticated AI-led cyberattack
Donating to a 501(c)(4) focused on AI issues in the public interest
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic's support for economic research comes to the UK and Europe
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Lessons and observations from generative AI in the first major election year since Claude has been available.
Introducing Claude Code upgrades: native VS Code extension, terminal UX updates, and checkpoints for autonomous development. Handle complex tasks with confidence.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic's mission is to build reliable, interpretable, steerable AI systems. We have been excited to see our technology used in areas like coding, customer service, drug discover
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Announcing a dramatic increase in Anthropic's compute resources
Claude is now available for purchase through the General Services Administration (GSA) schedule, making it easier for all U.S. federal government departments and agencies to quickl


Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Starting today, the new Claude 3.5 Sonnet begins rolling out on GitHub Copilot, enabling developers to choose Claude 3.5 Sonnet for coding—directly in Visual Studio Code and GitHub

When we turn up the strength of the “Golden Gate Bridge” feature, Claude’s responses begin to focus on the Golden Gate Bridge. For a short time, we’re making this model available f
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude 3 Haiku and Claude 3 Sonnet are now generally available on Google Cloud’s Vertex AI platform.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

An announcement of Anthropic's plans to expand across Europe

An announcement of Anthropic's plans to expand into Japan
Introducing Claude for Healthcare with HIPAA-ready infrastructure, plus expanded Life Sciences tools for clinical trials and regulatory submissions. New connectors to CMS, Medidata
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Claude for Education

Claude is now available in Canada. Starting today, people and businesses across the country will be able to access Claude.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Join us on May 22, 2025 in San Francisco for Code with Claude, a hands-on developer conference featuring workshops, labs, and insights on building with Claude API, CLI tools, and M
Today, we’re announcing the formation of the Anthropic Economic Advisory Council, a group of distinguished economists who will provide Anthropic with expert guidance on the economi
Anthropic's new research initiative exploring AI's impact on the future of work and economy, developing policy frameworks for a changing workforce.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Energy is central to winning the AI race and we need to ensure that America has the necessary infrastructure to maintain its lead. The importance of building this infrastructure go

Today, we're announcing that Jay Kreps, co-founder and CEO of Confluent, has joined Anthropic's Board of Directors. Jay's extensive experience in building and scaling highly succes

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Lawrence Livermore National Laboratory expands Claude for Enterprise access to 10,000 scientists, accelerating breakthroughs in energy, and national security research.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

We're excited to announce that Mike Krieger has joined Anthropic as our Chief Product Officer.

The Model Context Protocol (MCP) is an open standard for connecting AI assistants to the systems where data lives, including content repositories, business tools, and development e

The rapid progression of AI model capabilities demands an equally swift advancement in safety protocols. As we work on developing the next generation of our AI safeguarding systems
Announcing a Memorandum of Understanding between Anthropic and the UK Government
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
We are removing barriers to government AI adoption by offering Claude for Enterprise and Claude for Government to all three branches of government, including federal civilian execu
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Learn about Anthropic's comprehensive framework for identifying, classifying, and mitigating potential harms from AI systems, ensuring responsible development of advanced AI techno
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A call for greater focus and urgency

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic will appoint Paul Smith as its first Chief Commercial Officer, who will assume the role later this year.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

In this post, we’ll discuss some of the specific steps we’ve taken to help us detect and mitigate potential misuse of our AI tools in political contexts.

Claude Pro and Team users can now organize chats into Projects. Projects bring together internal knowledge and chat activity in one place so Claude can be your go-to expert for gen

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Last summer we published our first Responsible Scaling Policy (RSP), which focuses on addressing catastrophic safety failures and misuse of frontier models. In adopting this policy

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
An update to Anthropic's policy to mitigate catastrophic risks from AI
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Salesforce enhances its Einstein 1 Studio with Anthropic's Claude AI models, now available through Amazon Bedrock. Learn how this integration empowers enterprises to improve effici
Anthropic submits detailed recommendations for strengthening US export controls on advanced AI chips and model weights. We advocate for maintaining America's compute advantage, adj
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic's response to the Secretary of War and advice for customers
A statement from Anthropic CEO, Dario Amodei, on Anthropic’s commitment to advancing America's leadership in building powerful and beneficial AI

A statement from our CEO on national security uses of AI

In this post, we are sharing what we have learned about the trajectory of potential national security risks from frontier AI models, along with some of our thoughts about challenge
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

This blog provides a snapshot of the work we've done since last summer to test our models for elections-related risks.
Today, we're launching a new bug bounty program to stress-test our latest safety measures, in partnership with HackerOne. Similar to the program we announced last summer, we're cha
Announcement of the new Anthropic Economic Index and description of the new data on AI use in occupations
We’re launching The Anthropic Institute, a new effort to confront the most significant challenges that powerful AI will pose to our societies.

Increasingly powerful AI systems have the potential to accelerate scientific progress, unlock new medical treatments, and grow the economy. But along with the remarkable new capabi

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
A targeted approach to increasing transparency in frontier AI development, focusing on safety standards and accountability measures for advanced AI systems.

We believe that the AI sector needs effective third-party testing for frontier AI systems. Developing a testing regime and associated policy interventions based on the insights of
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

We're updating the policies that protect our users and ensure our products and services are used responsibly.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Updates to our Usage Policy that reflect the growing capabilities and evolving usage of our products

A statement from Dario Amodei

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

New research on simulated blackmail, industrial espionage, and other misaligned behaviors in LLMs
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic's AI Fluency Index measures 11 observable behaviors across thousands of Claude.ai conversations to understand how people develop AI collaboration skills.

A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models
This report introduces new metrics of AI usage to provide a rich portrait of interactions with Claude in November 2025, just prior to the release of Opus 4.5.
To study such patterns of early AI adoption, we extend the Anthropic Economic Index along two important dimensions, introducing a geographic analysis of Claude.ai conversations and
What 1,250 professionals told us about working with AI

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A collaboration between Anthropic's Alignment Science and Interpretability teams
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
How we've improved Claude's cyber defense skills

Discover how Anthropic approaches the development of reliable AI agents. Learn about our research on agent capabilities, safety considerations, and technical framework for building

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A blog post describing Anthropic’s new system, Clio, for analyzing how people use AI while maintaining their privacy

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Announcing a new collaborative research paper on Confidential Inference, a set of tools to improve the security of our model weights and of our users' data

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A paper from Anthropic describing a new way to guard LLMs against jailbreaking

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

New research from Anthropic exploring geographic patterns of AI use
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
We’ve asked economists and researchers to explore policy responses to the potential economic effects of powerful AI. We share some of the initial ideas and feedback we’ve received.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

All modern language models sometimes act like they have emotions. What’s behind these behaviors? Our interpretability team investigates.

An update on our exploratory research on model welfare

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic economic research on productivity gains

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A new piece of Anthropic research by Durmus et al.: "Evaluating feature steering: A case study in mitigating social biases"

Announcing a new research program at Anthropic on model welfare

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic research on predicting rare, undesirable AI behaviors
How AI Is Transforming Work at Anthropic
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Data on how software developers are using Claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Research from Anthropic on the ability of large language models to introspect
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, p
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic developed a way to test how persuasive language models (LMs) are, and analyzed how persuasiveness scales across different versions of Claude.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
A theory of why AI models act like humans
A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior

A new automated auditing tool for AI safety research

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A practical experiment on AI's ability to affect the physical world

We let Claude run a small shop in the Anthropic office. Here's what happened.

How Claude turned around its failing vending machine business
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Research from Anthropic on the faithfulness of AI models' Chain-of-Thought

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Empirical evidence that serious misalignment can emerge from seemingly benign reward misspecification.

A new paper on AI safety evaluations from Anthropic's Alignment Science team

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A new set of evaluations to test the sabotage and monitoring capabilities of LLM AI models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic research on data-poisoning attacks in large language models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Explore Claude's breakthrough performance on SWE-Bench, demonstrating advanced software engineering capabilities and code generation accuracy. Learn about our technical evaluation

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

An Anthropic research paper testing which values AI models express in the real world
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Discussing Claude's new thought process

This is a technical report on three bugs that intermittently degraded responses from Claude. Below we explain what happened, why it took time to fix, and what we're changing.

Claude can now discover, learn, and execute tools dynamically to enable agents that take action in the real world. Here’s how.

What we learned from three iterations of a performance engineering take-home that Claude keeps beating.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Discover how Anthropic approaches the development of reliable AI agents. Learn about our research on agent capabilities, safety considerations, and technical framework for building

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Tips and patterns for getting the most out of Claude Code, from configuring your environment to scaling across parallel sessions.

Learn how Claude Code's new sandboxing feature protects developers with filesystem and network isolation, reducing permission prompts and increasing user safety.

A blog post for developers, describing a new method for complex tool-use situations

Learn how code execution with the Model Context Protocol enables agents to handle more tools while using fewer tokens, reducing context overhead by up to 98.7%.

Explore how Anthropic enhances AI systems through advanced contextual retrieval methods. Learn about our approach to improving information access and relevance in large language mo

Demystifying evals for AI agents

Claude Desktop Extensions: One-click MCP server installation for Claude Desktop

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Discover how Anthropic builds AI agents with practical capabilities through modular skills, enabling them to handle complex real-world tasks more effectively and reliably.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

On the the engineering challenges and lessons learned from building Claude's Research system

Explore Claude's breakthrough performance on SWE-Bench, demonstrating advanced software engineering capabilities and code generation accuracy. Learn about our technical evaluation

Writing effective tools for AI agents—using AI agents