页面画廊

从 Anthropic sitemap 抓取的 340 个带插图页面。图片分两类： cdn.sanity.io 上的定制插图；以及由 /api/opengraph-illustration?name=… 接口生成的符号插图。

共 340 个

News & Announcements203

Introducing 100K Context Windows

— · /news/100k-context-windows

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku

— · /news/3-5-models-and-computer-use

A refreshed, more powerful Claude 3.5 Sonnet, Claude 3.5 Haiku, and a new experimental AI capability: computer use.

A new initiative for developing third-party model evaluations

— · /news/a-new-initiative-for-developing-third-party-model-evaluations

A robust, third-party evaluation ecosystem is essential for assessing AI capabilities and risks, but the current evaluations landscape is limited. Developing high-quality, safety-r

How scientists are using Claude to accelerate research and discovery

— · /news/accelerating-scientific-research

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Accenture, AWS, Anthropic Collaboration

— · /news/accenture-aws-anthropic

Anthropic, AWS, and Accenture Team Up to Build Trusted Solutions for Enterprises

Anthropic acquires Vercept to advance Claude's computer use capabilities

— · /news/acquires-vercept

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Activating AI Safety Level 3 protections

— · /news/activating-asl3-protections

We have activated the AI Safety Level 3 (ASL-3) Deployment and Security Standards described in Anthropic’s Responsible Scaling Policy (RSP) in conjunction with launching Claude Opu

Advancing Claude for Education

— · /news/advancing-claude-for-education

A first look at new education-specific integrations, expanded student programs, and university updates.

Advancing Claude for Financial Services

— · /news/advancing-claude-for-financial-services

Claude for Financial Services now supports a native Excel plug-in, new connectors to real-time market, and pre-built skills for modeling, comp analysis, and earnings reports.

Introducing Anthropic's AI for Science Program

— · /news/ai-for-science-program

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

An AI Policy Tool for Today: Ambitiously Invest in NIST

— · /news/an-ai-policy-tool-for-today-ambitiously-invest-in-nist

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Announcing our updated Responsible Scaling Policy

— · /news/announcing-our-updated-responsible-scaling-policy

Today we are publishing a significant update to our Responsible Scaling Policy (RSP), the risk governance framework we use to mitigate potential catastrophic risks from frontier AI

Accenture and Anthropic launch multi-year partnership to move enterprises from AI pilots to production

— · /news/anthropic-accenture-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic achieves ISO 42001 certification for responsible AI

— · /news/anthropic-achieves-iso-42001-certification-for-responsible-ai

We are excited to announce that Anthropic has achieved accredited certification under the new ISO/IEC 42001:2023 standard for our AI management system. ISO 42001 is the first inter

Anthropic acquires Bun as Claude Code reaches $1B milestone

— · /news/anthropic-acquires-bun-as-claude-code-reaches-usd1b-milestone

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic and Amazon expand collaboration for up to 5 gigawatts of new compute

— · /news/anthropic-amazon-compute

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Powering the next generation of AI development with AWS

— · /news/anthropic-amazon-trainium

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Expanding access to safer AI with Amazon

— · /news/anthropic-amazon

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic and Iceland announce one of the world’s first national AI education pilots

— · /news/anthropic-and-iceland-announce-one-of-the-world-s-first-national-ai-education-pilots

Anthropic and Iceland announce national AI education pilot

Anthropic awarded $200M DOD agreement for AI capabilities

— · /news/anthropic-and-the-department-of-defense-to-advance-responsible-ai-in-defense-operations

The U.S. Department of Defense (DOD), through its Chief Digital and Artificial Intelligence Office (CDAO), has awarded Anthropic a two-year prototype other transaction agreement wi

Anthropic appoints Irina Ghose as Managing Director of India ahead of Bengaluru office opening

— · /news/anthropic-appoints-irina-ghose-as-managing-director-of-india

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic partners with BCG

— · /news/anthropic-bcg

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic partners with CodePath to bring Claude to the US’s largest collegiate computer science program

— · /news/anthropic-codepath-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Economic Index: Insights from Claude 3.7 Sonnet

— · /news/anthropic-economic-index-insights-from-claude-sonnet-3-7

The second update from the Anthropic Economic Index

Anthropic education report: How educators use Claude

— · /news/anthropic-education-report-how-educators-use-claude

Research on 74,000 educator conversations shows how faculty use Claude for teaching, research, and building interactive learning tools.

Anthropic Education Report: How University Students Use Claude

— · /news/anthropic-education-report-how-university-students-use-claude

AI systems are no longer just specialized research tools: they’re everyday academic companions. As AIs integrate more deeply into educational environments, we need to consider impo

Anthropic expands global leadership in enterprise AI, naming Chris Ciauri as Managing Director of International

— · /news/anthropic-expands-global-leadership-in-enterprise-ai-naming-chris-ciauri-as-managing-director-of

Chris Ciauri joins Anthropic as Managing Director of International, adding to our global leadership team as we expand our worldwide presence.

Anthropic launches higher education advisory board and AI Fluency courses

— · /news/anthropic-higher-education-initiatives

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic and Infosys collaborate to build AI agents for telecommunications and other regulated industries

— · /news/anthropic-infosys

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic invests $50 billion in American AI infrastructure

— · /news/anthropic-invests-50-billion-in-american-ai-infrastructure

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic is endorsing SB 53

— · /news/anthropic-is-endorsing-sb-53

Anthropic is endorsing SB 53, the California bill that governs powerful AI systems built by frontier AI developers like Anthropic.

Anthropic and NEC partner to build AI-native engineering at scale in Japan

— · /news/anthropic-nec

NEC will deploy Claude to 30,000 employees and become Anthropic's first Japan-based global partner, co-developing AI products for finance, manufacturing, and government

Anthropic partners with Allen Institute and Howard Hughes Medical Institute to accelerate scientific discovery

— · /news/anthropic-partners-with-allen-institute-and-howard-hughes-medical-institute

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Partners with Google Cloud

— · /news/anthropic-partners-with-google-cloud

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic partners with Menlo Ventures to launch Anthology Fund

— · /news/anthropic-partners-with-menlo-ventures-to-launch-anthology-fund

Anthropic partners with Menlo Ventures to launch Anthology Fund

Anthropic partners with the University of Chicago’s Becker Friedman Institute for Economics on AI economic research

— · /news/anthropic-partners-with-the-university-of-chicago-s-becker-friedman-institute-on-ai-economic

Anthropic collaborates with the University of Chicago's Becker Friedman Institute to research AI's effects on labor markets, productivity, and economic distribution, enhancing our

Anthropic partners with U.S. National Labs for first 1,000 Scientist AI Jam

— · /news/anthropic-partners-with-u-s-national-labs-for-first-1-000-scientist-ai-jam

We are proud to participate in the U.S. Department of Energy’s (DOE) first-ever 1,000 Scientist AI Jam, which will bring together scientists across multiple national laboratories t

Anthropic raises $124 million to build more reliable, general AI systems

— · /news/anthropic-raises-124-million-to-build-more-reliable-general-ai-systems

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic raises $30 billion in Series G funding at $380 billion post-money valuation

— · /news/anthropic-raises-30-billion-series-g-funding-380-billion-post-money-valuation

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Raises Series B to build steerable, interpretable, robust AI systems

— · /news/anthropic-raises-series-b-to-build-safe-reliable-ai

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic raises Series E at $61.5B post-money valuation

— · /news/anthropic-raises-series-e-at-usd61-5b-post-money-valuation

Anthropic has raised $3.5 billion at a $61.5 billion post-money valuation. The round was led by Lightspeed Venture Partners, with participation from Bessemer Venture Partners, Cisc

Anthropic raises $13B Series F at $183B post-money valuation

— · /news/anthropic-raises-series-f-at-usd183b-post-money-valuation

Anthropic has completed a Series F fundraising of $13 billion led by ICONIQ. This financing values Anthropic at $183 billion post-money. Along with ICONIQ, the round was co-led by

Anthropic and the Government of Rwanda sign MOU for AI in health and education

— · /news/anthropic-rwanda-mou

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan

— · /news/anthropic-s-recommendations-ostp-u-s-ai-action-plan

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic’s response to Governor Newsom’s AI working group draft report

— · /news/anthropic-s-response-to-governor-newsom-s-ai-working-group-draft-report

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Raises $450 Million in Series C Funding to Scale Reliable AI Products

— · /news/anthropic-series-c

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic signs CMS health tech pledge

— · /news/anthropic-signs-cms-health-tech-ecosystem-pledge-to-advance-healthcare-interoperability

Anthropic signs CMS health tech pledge

Anthropic joins White House pledge for AI education

— · /news/anthropic-signs-pledge-to-americas-youth-investing-in-ai-education

Anthropic signs White House pledge investing in AI education for America's youth, supporting AI students and educators nationwide

Anthropic and Teach For All launch global AI training initiative for educators

— · /news/anthropic-teach-for-all

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic's Responsible Scaling Policy

— · /news/anthropics-responsible-scaling-policy

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Apple’s Xcode now supports the Claude Agent SDK

— · /news/apple-xcode-claude-agent-sdk

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Australian government and Anthropic sign MOU for AI safety and research

— · /news/australia-MOU

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic opens Bengaluru office and announces new partnerships across India

— · /news/bengaluru-office-partnerships-across-india

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Build AI in America: Anthropic Energy Report

— · /news/build-ai-in-america

Build AI in America: Anthropic Energy Report

Building safeguards for Claude

— · /news/building-safeguards-for-claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Challenges in Red Teaming AI Systems

— · /news/challenges-in-red-teaming-ai-systems

In this post we detail insights from a sample of red teaming approaches that we’ve used to test our AI systems. Through this practice, we’ve begun to gather empirical data about th

Charting a Path to AI Accountability

— · /news/charting-a-path-to-ai-accountability

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Aligning on child safety principles

— · /news/child-safety-principles

Alongside other leading AI companies, we’re committed to implementing robust child safety measures in the development, deployment, and maintenance of generative AI technologies.

Chris Liddell appointed to Anthropic’s board of directors

— · /news/chris-liddell-appointed-anthropic-board

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude 2.1

— · /news/claude-2-1

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude 2

— · /news/claude-2

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude 3.5 Sonnet

— · /news/claude-3-5-sonnet

Introducing Claude 3.5 Sonnet—our most intelligent model yet. Sonnet now outperforms competitor models and Claude 3 Opus on key evaluations, at twice the speed.

Claude 3.7 Sonnet and Claude Code

— · /news/claude-3-7-sonnet

Today, we’re announcing Claude 3.7 Sonnet, our most intelligent model to date and the first hybrid reasoning model generally available on the market.

Introducing the next generation of Claude

— · /news/claude-3-family

Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in a

Claude 3 Haiku: our fastest model yet

— · /news/claude-3-haiku

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude 4

— · /news/claude-4

Discover Claude 4's breakthrough AI capabilities. Experience more reliable, interpretable assistance for complex tasks across work and learning.

Claude and Alexa+

— · /news/claude-and-alexa-plus

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude is now available in Brazil

— · /news/claude-brazil

Claude, Anthropic’s trusted AI assistant, is now available in Brazil. Starting today, consumers and businesses in Brazil will be able to access Claude.

Claude Code and new admin controls for business plans

— · /news/claude-code-on-team-and-enterprise

Enterprise and Team customers can now upgrade to premium seats that include more usage and Claude Code—bringing our app and powerful coding agent together under one subscription.

Making frontier cybersecurity capabilities available to defenders

— · /news/claude-code-security

Claude Code Security is one step towards our goal of more secure codebases and a higher security baseline across the industry.

Introducing Claude Design by Anthropic Labs

— · /news/claude-design-anthropic-labs

Today, we’re launching Claude Design, a new Anthropic Labs product that lets you collaborate with Claude to create polished visual work like designs, prototypes, slides, one-pagers

Claude is now available in the EU

— · /news/claude-europe

We’re excited to announce that Claude, Anthropic’s trusted AI assistant, is now available for people and businesses across Europe to enhance their productivity and creativity.

Claude for Financial Services

— · /news/claude-for-financial-services

Today, we're introducing a comprehensive solution for financial analysis that transforms how finance professionals analyze markets, conduct research, and make investment decisions

Claude for Life Sciences

— · /news/claude-for-life-sciences

Discover how Claude accelerates life sciences research with new scientific connectors, skills, and improved performance for drug discovery and clinical work.

Introducing Claude for Nonprofits

— · /news/claude-for-nonprofits

Anthropic launches Claude for Nonprofits to help organizations maximize their impact, featuring free AI training and discounted rates for nonprofits.

Claude Gov models for U.S. national security customers

— · /news/claude-gov-models-for-u-s-national-security-customers

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude Haiku 4.5

— · /news/claude-haiku-4-5

Claude Haiku 4.5, our latest small model, is available today to all users.

Claude in Amazon Bedrock: Approved for Use in FedRAMP High and DoD IL4/5 Workloads

— · /news/claude-in-amazon-bedrock-fedramp-high

Claude models are approved for use in FedRAMP High and DoD Impact Level 4 and 5 workloads through Amazon Bedrock in AWS GovCloud (US) regions. Federal agencies and defense organiza

Claude now available in Microsoft Foundry and Microsoft 365 Copilot

— · /news/claude-in-microsoft-foundry

Claude Sonnet 4.5, Haiku 4.5, and Opus 4.1 models are now available in public preview in Microsoft Foundry, where Azure customers can build production applications and enterprise a

Claude is now generally available in Xcode

— · /news/claude-in-xcode

Connect your Claude account to Xcode 26 for AI-powered coding assistance. Debug, refactor, and build Apple apps faster with Claude Sonnet 4 by Anthropic.

Claude is a space to think | Anthropic

— · /news/claude-is-a-space-to-think

We’ve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access with

Claude's new constitution

— · /news/claude-new-constitution

A new approach to a foundational document that expresses and shapes who Claude is

Claude Opus 4.1

— · /news/claude-opus-4-1

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude Opus 4.5

— · /news/claude-opus-4-5

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude Opus 4.6

— · /news/claude-opus-4-6

We’re upgrading our smartest model. Across agentic coding, computer use, tool use, search, and finance, Opus 4.6 is an industry-leading model, often by wide margin.

Introducing Claude Opus 4.7

— · /news/claude-opus-4-7

Our latest model, Claude Opus 4.7, is now generally available. Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most dif

Anthropic invests $100 million into the Claude Partner Network

— · /news/claude-partner-network

We’re launching the Claude Partner Network, a program for partner organizations helping enterprises adopt Claude.

Introducing Claude Pro

— · /news/claude-pro

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude Sonnet 4.5

— · /news/claude-sonnet-4-5

Claude Sonnet 4.5 is the best coding model in the world, strongest model for building complex agents, and best model at using computers.

Introducing Sonnet 4.6

— · /news/claude-sonnet-4-6

Claude Sonnet 4.6 is a full upgrade of the model’s skills across coding, computer use, long-reasoning, agent planning, knowledge work, and design.

Claude’s Constitution

— · /news/claudes-constitution

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Cognizant will make Claude available to 350,000 employees, accelerating enterprise AI adoption and internal transformation

— · /news/cognizant-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Sharing our compliance framework for California's Transparency in Frontier AI Act

— · /news/compliance-framework-SB53

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Contextual Retrieval in AI Systems

— · /news/contextual-retrieval

Explore how Anthropic enhances AI systems through advanced contextual retrieval methods. Learn about our approach to improving information access and relevance in large language mo

Core Views on AI Safety: When, Why, What, and How

— · /news/core-views-on-ai-safety

AI progress may lead to transformative AI systems in the next decade, but we do not yet understand how to make such systems safe and aligned with human values. In response, we are

Covering electricity price increases from our data centers

— · /news/covering-electricity-price-increases

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Deloitte Partnership

— · /news/deloitte-anthropic-partnership

Deloitte will make Claude available to 470,000 people across its global network. Anthropic's largest enterprise AI deployment to date. Partner with Anthropic because Claude is buil

Detecting and Countering Malicious Uses of Claude

— · /news/detecting-and-countering-malicious-uses-of-claude-march-2025

Detecting and Countering Malicious Uses of Claude

Detecting and preventing distillation attacks

— · /news/detecting-and-preventing-distillation-attacks

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Detecting and countering misuse of AI: August 2025

— · /news/detecting-countering-misuse-aug-2025

Anthropic's threat intelligence report on AI cybercrime and other abuses

Developing a computer use model

— · /news/developing-computer-use

Developing a computer use model

Developing nuclear safeguards for AI through public-private partnership

— · /news/developing-nuclear-safeguards-for-ai-through-public-private-partnership

Together with the NNSA and DOE national laboratories, we have co-developed a classifier—an AI system that automatically categorizes content—that distinguishes between concerning an

Disrupting the first reported AI-orchestrated cyber espionage campaign

— · /news/disrupting-AI-espionage

A report describing an a highly sophisticated AI-led cyberattack

Anthropic is donating $20 million to Public First Action

— · /news/donate-public-first-action

Donating to a 501(c)(4) focused on AI issues in the public interest

Donating the Model Context Protocol and establishing the Agentic AI Foundation

— · /news/donating-the-model-context-protocol-and-establishing-of-the-agentic-ai-foundation

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Launching the Anthropic Economic Futures Programme in the UK and Europe

— · /news/economic-futures-uk-europe

Anthropic's support for economic research comes to the UK and Europe

An update on our election safeguards

— · /news/election-safeguards-update

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Elections and AI in 2024: Anthropic observations and learnings

— · /news/elections-ai-2024

Lessons and observations from generative AI in the first major election year since Claude has been available.

Enabling Claude Code to work more autonomously

— · /news/enabling-claude-code-to-work-more-autonomously

Introducing Claude Code upgrades: native VS Code extension, terminal UX updates, and checkpoints for autonomous development. Handle complex tasks with confidence.

Anthropic to sign the EU Code of Practice

— · /news/eu-code-practice

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Expanded legal protections and improvements to our API

— · /news/expanded-legal-protections-api-improvements

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Expanding Access to Claude for Government

— · /news/expanding-access-to-claude-for-government

Anthropic's mission is to build reliable, interpretable, steerable AI systems. We have been excited to see our technology used in areas like coding, customer service, drug discover

Anthropic expands global operations to India, plans to open an office in Bengaluru.

— · /news/expanding-global-operations-to-india

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Expanding our use of Google Cloud TPUs and Services

— · /news/expanding-our-use-of-google-cloud-tpus-and-services

Announcing a dramatic increase in Anthropic's compute resources

U.S. federal departments and agencies can now more quickly and easily get access to Claude

— · /news/federal-government-departments-and-agencies-can-now-purchase-claude-through-the-gsa-schedule

Claude is now available for purchase through the General Services Administration (GSA) schedule, making it easier for all U.S. federal government departments and agencies to quickl

(untitled)

— · /news/fine-tune-claude-3-haiku

Frontier Model Security

— · /news/frontier-model-security

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Frontier Threats Red Teaming for AI Safety

— · /news/frontier-threats-red-teaming-for-ai-safety

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Working with the US Department of Energy to unlock the next era of scientific discovery

— · /news/genesis-mission-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude 3.5 Sonnet on GitHub Copilot

— · /news/github-copilot

Starting today, the new Claude 3.5 Sonnet begins rolling out on GitHub Copilot, enabling developers to choose Claude 3.5 Sonnet for coding—directly in Visual Studio Code and GitHub

Golden Gate Claude

— · /news/golden-gate-claude

When we turn up the strength of the “Golden Gate Bridge” feature, Claude’s responses begin to focus on the Golden Gate Bridge. For a short time, we’re making this model available f

Anthropic expands partnership with Google and Broadcom for multiple gigawatts of next-generation compute

— · /news/google-broadcom-partnership-compute

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude 3 models on Vertex AI

— · /news/google-vertex-general-availability

Claude 3 Haiku and Claude 3 Sonnet are now generally available on Google Cloud’s Vertex AI platform.

Anthropic partners with the UK Government to bring AI assistance to GOV.UK services

— · /news/gov-UK-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Appoints Guillaume Princen as Head of EMEA and Announces 100+ New Roles Across the Region

— · /news/head-of-EMEA-new-roles

An announcement of Anthropic's plans to expand across Europe

Anthropic appoints Hidetoshi Tojo as Head of Japan and announces hiring plans

— · /news/head-of-japan-hiring-plans

An announcement of Anthropic's plans to expand into Japan

Advancing Claude in healthcare and the life sciences

— · /news/healthcare-life-sciences

Introducing Claude for Healthcare with HIPAA-ready infrastructure, plus expanded Life Sciences tools for clinical trials and regulatory submissions. New connectors to CMS, Medidata

How people use Claude for support, advice, and companionship

— · /news/how-people-use-claude-for-support-advice-and-companionship

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Labs

— · /news/introducing-anthropic-labs

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Anthropic's Transparency Hub

— · /news/introducing-anthropic-transparency-hub

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Claude for education

— · /news/introducing-claude-for-education

Claude for Education

Introducing Claude to Canada

— · /news/introducing-claude-to-canada

Claude is now available in Canada. Starting today, people and businesses across the country will be able to access Claude.

Introducing Claude

— · /news/introducing-claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Code with Claude - Anthropic's First Developer Conference

— · /news/Introducing-code-with-claude

Join us on May 22, 2025 in San Francisco for Code with Claude, a hands-on developer conference featuring workshops, labs, and insights on building with Claude API, CLI tools, and M

Introducing the Anthropic Economic Advisory Council

— · /news/introducing-the-anthropic-economic-advisory-council

Today, we’re announcing the formation of the Anthropic Economic Advisory Council, a group of distinguished economists who will provide Anthropic with expert guidance on the economi

Anthropic Economic Futures Program Launch

— · /news/introducing-the-anthropic-economic-futures-program

Anthropic's new research initiative exploring AI's impact on the future of work and economy, developing policy frameworks for a changing workforce.

Introducing the Anthropic National Security and Public Sector Advisory Council

— · /news/introducing-the-anthropic-national-security-and-public-sector-advisory-council

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Investing in energy to secure America's AI future

— · /news/investing-in-energy-to-secure-america-s-ai-future

Energy is central to winning the AI race and we need to ensure that America has the necessary infrastructure to maintain its lead. The importance of building this infrastructure go

Jay Kreps appointed to Anthropic's Board of Directors

— · /news/jay-kreps-appointed-to-board-of-directors

Today, we're announcing that Jay Kreps, co-founder and CEO of Confluent, has joined Anthropic's Board of Directors. Jay's extensive experience in building and scaling highly succes

Krishna Rao joins Anthropic as Chief Financial Officer

— · /news/krishna-rao-joins-anthropic

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude for Enterprise Powers LLNL Research

— · /news/lawrence-livermore-national-laboratory-expands-claude-for-enterprise-to-empower-scientists-and

Lawrence Livermore National Laboratory expands Claude for Enterprise access to 10,000 scientists, accelerating breakthroughs in energy, and national security research.

Lyft to bring Claude to more than 40 million riders and over 1 million drivers

— · /news/lyft-announcement

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Mariano-Florentino Cuéllar appointed to Anthropic’s Long-Term Benefit Trust

— · /news/mariano-florentino-long-term-benefit-trust

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

The State of Maryland partners with Anthropic to better serve residents

— · /news/maryland-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Microsoft, NVIDIA and Anthropic announced new strategic partnerships.

— · /news/microsoft-nvidia-anthropic-announce-strategic-partnerships

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Mike Krieger joins Anthropic as Chief Product Officer

— · /news/mike-krieger-joins-anthropic

We're excited to announce that Mike Krieger has joined Anthropic as our Chief Product Officer.

Introducing the Model Context Protocol

— · /news/model-context-protocol

The Model Context Protocol (MCP) is an open standard for connecting AI assistants to the systems where data lives, including content repositories, business tools, and development e

Expanding our model safety bug bounty program

— · /news/model-safety-bug-bounty

The rapid progression of AI model capabilities demands an equally swift advancement in safety protocols. As we work on developing the next generation of our AI safeguarding systems

Anthropic signs MOU with UK Government to explore how AI can transform UK public services

— · /news/mou-uk-government

Announcing a Memorandum of Understanding between Anthropic and the UK Government

Partnering with Mozilla to improve Firefox’s security

— · /news/mozilla-firefox-security

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic’s Long-Term Benefit Trust appoints Vas Narasimhan to Board of Directors

— · /news/narasimhan-board

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

National security expert Richard Fontaine appointed to Anthropic’s long-term benefit trust

— · /news/national-security-expert-richard-fontaine-appointed-to-anthropic-s-long-term-benefit-trust

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

New offices in Paris and Munich expand Anthropic’s European presence

— · /news/new-offices-in-paris-and-munich-expand-european-presence

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Offering expanded Claude access across all three branches of government

— · /news/offering-expanded-claude-access-across-all-three-branches-of-government

We are removing barriers to government AI adoption by offering Claude for Enterprise and Claude for Government to all three branches of government, including federal civilian execu

Anthropic opens Tokyo office, signs a Memorandum of Cooperation with the Japan AI Safety Institute

— · /news/opening-our-tokyo-office

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Understanding and Addressing AI Harms

— · /news/our-approach-to-understanding-and-addressing-ai-harms

Learn about Anthropic's comprehensive framework for identifying, classifying, and mitigating potential harms from AI systems, ensuring responsible development of advanced AI techno

Our framework for developing safe and trustworthy agents

— · /news/our-framework-for-developing-safe-and-trustworthy-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Statement from Dario Amodei on the Paris AI Action Summit

— · /news/paris-ai-summit

A call for greater focus and urgency

Partnering with Scale to Bring Generative AI to Enterprises

— · /news/partnering-with-scale

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Paul Smith to join Anthropic as Chief Commercial Officer

— · /news/paul-smith-to-join-anthropic

Anthropic will appoint Paul Smith as its first Chief Commercial Officer, who will assume the role later this year.

Thoughts on the US Executive Order, G7 Code of Conduct, and Bletchley Park Summit

— · /news/policy-recap-q4-2023

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Measuring political bias in Claude

— · /news/political-even-handedness

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Preparing for global elections in 2024

— · /news/preparing-for-global-elections-in-2024

In this post, we’ll discuss some of the specific steps we’ve taken to help us detect and mitigate potential misuse of our AI tools in political contexts.

Collaborate with Claude on Projects

— · /news/projects

Claude Pro and Team users can now organize chats into Projects. Projects bring together internal knowledge and chat activity in one place so Claude can be your go-to expert for gen

Prompt engineering for business performance

— · /news/prompt-engineering-for-business-performance

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Prompt engineering for Claude's long context window

— · /news/prompting-long-context

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Protecting the wellbeing of our users

— · /news/protecting-well-being-of-users

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Rahul Patil joins Anthropic as Chief Technology Officer

— · /news/rahul-patil-joins-anthropic

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Reed Hastings appointed to Anthropic’s board of directors

— · /news/reed-hastings

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Reflections on our Responsible Scaling Policy

— · /news/reflections-on-our-responsible-scaling-policy

Last summer we published our first Responsible Scaling Policy (RSP), which focuses on addressing catastrophic safety failures and misuse of frontier models. In adopting this policy

Releasing Claude Instant 1.2

— · /news/releasing-claude-instant-1-2

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Responsible Scaling Policy Version 3.0

— · /news/responsible-scaling-policy-v3

An update to Anthropic's policy to mitigate catastrophic risks from AI

Anthropic partners with Rwandan Government and ALX to bring AI education to hundreds of thousands of learners across Africa

— · /news/rwandan-government-partnership-ai-education

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic and Salesforce expand partnership to bring Claude to regulated industries

— · /news/salesforce-anthropic-expanded-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Salesforce integrates Anthropic's Claude AI to boost Einstein capabilities

— · /news/salesforce-partnership

Salesforce enhances its Einstein 1 Studio with Anthropic's Claude AI models, now available through Amazon Bedrock. Learn how this integration empowers enterprises to improve effici

Anthropic's AI Export Controls Framework Response

— · /news/securing-america-s-compute-advantage-anthropic-s-position-on-the-diffusion-rule

Anthropic submits detailed recommendations for strengthening US export controls on advanced AI chips and model weights. We advocate for maintaining America's compute advantage, adj

Seoul becomes Anthropic’s third office in Asia-Pacific as we continue our international growth

— · /news/seoul-becomes-third-anthropic-office-in-asia-pacific

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

ServiceNow chooses Claude to power customer apps and increase internal productivity

— · /news/servicenow-anthropic-claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

SKT Partnership Announcement

— · /news/skt-partnership-announcement

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Snowflake and Anthropic announce $200 million partnership to bring agentic AI to global enterprises

— · /news/snowflake-anthropic-expanded-partnership

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Statement on the comments from Secretary of War Pete Hegseth

— · /news/statement-comments-secretary-war

Anthropic's response to the Secretary of War and advice for customers

A statement from Dario Amodei on Anthropic's commitment to American AI leadership

— · /news/statement-dario-amodei-american-ai-leadership

A statement from Anthropic CEO, Dario Amodei, on Anthropic’s commitment to advancing America's leadership in building powerful and beneficial AI

Statement from Dario Amodei on our discussions with the Department of War

— · /news/statement-department-of-war

A statement from our CEO on national security uses of AI

Progress from our Frontier Red Team

— · /news/strategic-warning-for-ai-risk-progress-and-insights-from-our-frontier-red-team

In this post, we are sharing what we have learned about the trajectory of potential national security risks from frontier AI models, along with some of our thoughts about challenge

Strengthening our safeguards through collaboration with US CAISI and UK AISI

— · /news/strengthening-our-safeguards-through-collaboration-with-us-caisi-and-uk-aisi

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Sydney will become Anthropic’s fourth office in Asia-Pacific

— · /news/sydney-fourth-office-asia-pacific

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Testing and mitigating elections-related risks

— · /news/testing-and-mitigating-elections-related-risks

This blog provides a snapshot of the work we've done since last summer to test our models for elections-related risks.

Testing our safety defenses with a new bug bounty program

— · /news/testing-our-safety-defenses-with-a-new-bug-bounty-program

Today, we're launching a new bug bounty program to stress-test our latest safety measures, in partnership with HackerOne. Similar to the program we announced last summer, we're cha

Introducing the Anthropic Economic Index

— · /news/the-anthropic-economic-index

Announcement of the new Anthropic Economic Index and description of the new data on AI use in occupations

Introducing The Anthropic Institute

— · /news/the-anthropic-institute

We’re launching The Anthropic Institute, a new effort to confront the most significant challenges that powerful AI will pose to our societies.

The case for targeted regulation

— · /news/the-case-for-targeted-regulation

Increasingly powerful AI systems have the potential to accelerate scientific progress, unlock new medical treatments, and grow the economy. But along with the remarkable new capabi

The Long-Term Benefit Trust

— · /news/the-long-term-benefit-trust

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A framework for AI development transparency

— · /news/the-need-for-transparency-in-frontier-ai

A targeted approach to increasing transparency in frontier AI development, focusing on safety standards and accountability measures for advanced AI systems.

Third-party testing as a key ingredient of AI policy

— · /news/third-party-testing

We believe that the AI sector needs effective third-party testing for frontier AI systems. Developing a testing regime and associated policy interventions based on the insights of

Thoughts on America’s AI Action Plan

— · /news/thoughts-on-america-s-ai-action-plan

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Dario Amodei’s prepared remarks from the AI Safety Summit on Anthropic’s Responsible Scaling Policy

— · /news/uk-ai-safety-summit

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Updates to Consumer Terms and Privacy Policy

— · /news/updates-to-our-consumer-terms

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Updating our Usage Policy

— · /news/updating-our-usage-policy

We're updating the policies that protect our users and ensure our products and services are used responsibly.

Updating restrictions of sales to unsupported regions

— · /news/updating-restrictions-of-sales-to-unsupported-regions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

U.S. Elections Readiness

— · /news/us-elections-readiness

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Usage Policy Update

— · /news/usage-policy-update

Updates to our Usage Policy that reflect the growing capabilities and evolving usage of our products

Where things stand with the Department of War

— · /news/where-stand-department-war

A statement from Dario Amodei

Zoom Partnership and Investment in Anthropic

— · /news/zoom-partnership-and-investment

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Research113

What 81,000 people told us about the economics of AI

— · /research/81k-economics

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A General Language Assistant as a Laboratory for Alignment

— · /research/a-general-language-assistant-as-a-laboratory-for-alignment

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A Mathematical Framework for Transformer Circuits

— · /research/a-mathematical-framework-for-transformer-circuits

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Agentic Misalignment: How LLMs could be insider threats

— · /research/agentic-misalignment

New research on simulated blackmail, industrial espionage, and other misaligned behaviors in LLMs

How AI assistance impacts the formation of coding skills

— · /research/AI-assistance-coding-skills

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Education Report: The AI Fluency Index

— · /research/AI-fluency-index

Anthropic's AI Fluency Index measures 11 observable behaviors across thousands of Claude.ai conversations to understand how people develop AI collaboration skills.

Alignment faking in large language models

— · /research/alignment-faking

A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models

Anthropic Economic Index report: Economic primitives

— · /research/anthropic-economic-index-january-2026-report

This report introduces new metrics of AI usage to provide a rich portrait of interactions with Claude in November 2025, just prior to the release of Opus 4.5.

Anthropic Economic Index report: Uneven geographic and enterprise AI adoption

— · /research/anthropic-economic-index-september-2025-report

To study such patterns of early AI adoption, we extend the Anthropic Economic Index along two important dimensions, introducing a geographic analysis of Claude.ai conversations and

Introducing Anthropic Interviewer

— · /research/anthropic-interviewer

What 1,250 professionals told us about working with AI

The assistant axis: situating and stabilizing the character of large language models

— · /research/assistant-axis

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Auditing language models for hidden objectives

— · /research/auditing-hidden-objectives

A collaboration between Anthropic's Alignment Science and Interpretability teams

Automated Alignment Researchers: Using large language models to scale scalable oversight

— · /research/automated-alignment-researchers

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing Bloom: an open source tool for automated behavioral evaluations

— · /research/bloom

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Building AI for cyber defenders

— · /research/building-ai-cyber-defenders

How we've improved Claude's cyber defense skills

Building Effective AI Agents

— · /research/building-effective-agents

Discover how Anthropic approaches the development of reliable AI agents. Learn about our research on agent capabilities, safety considerations, and technical framework for building

Circuits Updates – April 2024

— · /research/circuits-updates-april-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – August 2024

— · /research/circuits-updates-august-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – July 2024

— · /research/circuits-updates-july-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – June 2024

— · /research/circuits-updates-june-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates — May 2023

— · /research/circuits-updates-may-2023

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Circuits Updates – September 2024

— · /research/circuits-updates-sept-2024

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude’s Character

— · /research/claude-character

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Clio: Privacy-preserving insights into real-world AI use

— · /research/clio

A blog post describing Anthropic’s new system, Clio, for analyzing how people use AI while maintaining their privacy

Collective Constitutional AI: Aligning a Language Model with Public Input

— · /research/collective-constitutional-ai-aligning-a-language-model-with-public-input

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Confidential Inference via Trusted Virtual Machines

— · /research/confidential-inference-trusted-vms

Announcing a new collaborative research paper on Confidential Inference, a set of tools to improve the security of our model weights and of our users' data

Constitutional AI: Harmlessness from AI Feedback

— · /research/constitutional-ai-harmlessness-from-ai-feedback

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Constitutional Classifiers: Defending against universal jailbreaks

— · /research/constitutional-classifiers

A paper from Anthropic describing a new way to guard LLMs against jailbreaking

Insights on Crosscoder Model Diffing

— · /research/crosscoder-model-diffing

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Decomposing Language Models Into Understandable Components

— · /research/decomposing-language-models-into-understandable-components

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Commitments on model deprecation and preservation

— · /research/deprecation-commitments

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

An update on our model deprecation commitments for Claude Opus 3

— · /research/deprecation-updates-opus-3

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A “diff” tool for AI: Finding behavioral differences in new models

— · /research/diff-tool

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Discovering Language Model Behaviors with Model-Written Evaluations

— · /research/discovering-language-model-behaviors-with-model-written-evaluations

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Disempowerment patterns in real-world AI usage

— · /research/disempowerment-patterns

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Distributed Representations: Composition & Superposition

— · /research/distributed-representations-composition-superposition

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Economic Index: Tracking AI's role in the US and global economy

— · /research/economic-index-geography

New research from Anthropic exploring geographic patterns of AI use

Anthropic Economic Index report: Learning curves

— · /research/economic-index-march-2026-report

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

The Anthropic Economic Index report: New building blocks for understanding AI use

— · /research/economic-index-primitives

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Announcing the Anthropic Economic Index Survey

— · /research/economic-index-survey-announcement

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Preparing for AI’s economic impact: exploring policy responses

— · /research/economic-policy-responses

We’ve asked economists and researchers to explore policy responses to the potential economic effects of powerful AI. We share some of the initial ideas and feedback we’ve received.

From shortcuts to sabotage: natural emergent misalignment from reward hacking

— · /research/emergent-misalignment-reward-hacking

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Emotion concepts and their function in a large language model

— · /research/emotion-concepts-function

All modern language models sometimes act like they have emotions. What’s behind these behaviors? Our interpretability team investigates.

Claude Opus 4 and 4.1 can now end a rare subset of conversations

— · /research/end-subset-conversations

An update on our exploratory research on model welfare

The engineering challenges of scaling interpretability

— · /research/engineering-challenges-interpretability

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Estimating AI productivity gains

— · /research/estimating-productivity-gains

Anthropic economic research on productivity gains

Challenges in evaluating AI systems

— · /research/evaluating-ai-systems

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Evaluating and Mitigating Discrimination in Language Model Decisions

— · /research/evaluating-and-mitigating-discrimination-in-language-model-decisions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Evaluating feature steering: A case study in mitigating social biases

— · /research/evaluating-feature-steering

A new piece of Anthropic research by Durmus et al.: "Evaluating feature steering: A case study in mitigating social biases"

Exploring model welfare

— · /research/exploring-model-welfare

Announcing a new research program at Anthropic on model welfare

Using dictionary learning features as classifiers

— · /research/features-as-classifiers

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Forecasting rare language model behaviors

— · /research/forecasting-rare-behaviors

Anthropic research on predicting rare, undesirable AI behaviors

How AI Is Transforming Work at Anthropic

— · /research/how-ai-is-transforming-work-at-anthropic

How AI Is Transforming Work at Anthropic

How Australia Uses Claude: Findings from the Anthropic Economic Index

— · /research/how-australia-uses-claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Economic Index: AI's impact on software development

— · /research/impact-software-development

Data on how software developers are using Claude

In-context Learning and Induction Heads

— · /research/in-context-learning-and-induction-heads

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

India Country Brief: The Anthropic Economic Index

— · /research/india-brief-economic-index

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Tracing Model Outputs to the Training Data

— · /research/influence-functions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Interpretability Dreams

— · /research/interpretability-dreams

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Introducing our Science Blog

— · /research/introducing-anthropic-science

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Emergent introspective awareness in large language models

— · /research/introspection

Research from Anthropic on the ability of large language models to introspect

Labor market impacts of AI: A new measure and early evidence

— · /research/labor-market-impacts

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Language Models (Mostly) Know What They Know

— · /research/language-models-mostly-know-what-they-know

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Long-running Claude for scientific computing

— · /research/long-running-Claude

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Many-shot jailbreaking

— · /research/many-shot-jailbreaking

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Mapping the Mind of a Large Language Model

— · /research/mapping-mind-language-model

We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, p

Measuring AI agent autonomy in practice

— · /research/measuring-agent-autonomy

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Measuring Faithfulness in Chain-of-Thought Reasoning

— · /research/measuring-faithfulness-in-chain-of-thought-reasoning

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Measuring the Persuasiveness of Language Models

— · /research/measuring-model-persuasiveness

Anthropic developed a way to test how persuasive language models (LMs) are, and analyzed how persuasiveness scales across different versions of Claude.

Measuring Progress on Scalable Oversight for Large Language Models

— · /research/measuring-progress-on-scalable-oversight-for-large-language-models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Next-generation Constitutional Classifiers: More efficient protection against universal jailbreaks

— · /research/next-generation-constitutional-classifiers

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Open-sourcing circuit-tracing tools

— · /research/open-source-circuit-tracing

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

The persona selection model

— · /research/persona-selection-model

A theory of why AI models act like humans

Persona vectors: Monitoring and controlling character traits in language models

— · /research/persona-vectors

A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior

Petri: An open-source auditing tool to accelerate AI safety research

— · /research/petri-open-source-auditing

A new automated auditing tool for AI safety research

Predictability and Surprise in Large Generative Models

— · /research/predictability-and-surprise-in-large-generative-models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Privileged Bases in the Transformer Residual Stream

— · /research/privileged-bases-in-the-transformer-residual-stream

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Simple probes can catch sleeper agents

— · /research/probes-catch-sleeper-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Project Fetch: Can Claude train a robot dog?

— · /research/project-fetch-robot-dog

A practical experiment on AI's ability to affect the physical world

Project Vend: Can Claude run a small shop? (And why does that matter?)

— · /research/project-vend-1

We let Claude run a small shop in the Anthropic office. Here's what happened.

Project Vend: Phase two

— · /research/project-vend-2

How Claude turned around its failing vending machine business

Mitigating the risk of prompt injections in browser use

— · /research/prompt-injection-defenses

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

— · /research/question-decomposition-improves-the-faithfulness-of-model-generated-reasoning

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Reasoning models don't always say what they think

— · /research/reasoning-models-dont-say-think

Research from Anthropic on the faithfulness of AI models' Chain-of-Thought

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

— · /research/red-teaming-language-models-to-reduce-harms-methods-scaling-behaviors-and-lessons-learned

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Sycophancy to subterfuge: Investigating reward tampering in language models

— · /research/reward-tampering

Empirical evidence that serious misalignment can emerge from seemingly benign reward misspecification.

Sabotage evaluations for frontier models

— · /research/sabotage-evaluations

A new paper on AI safety evaluations from Anthropic's Alignment Science team

Scaling Laws and Interpretability of Learning from Repeated Data

— · /research/scaling-laws-and-interpretability-of-learning-from-repeated-data

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents

— · /research/shade-arena-sabotage-monitoring

A new set of evaluations to test the sabotage and monitoring capabilities of LLM AI models

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

— · /research/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A small number of samples can poison LLMs of any size

— · /research/small-samples-poison

Anthropic research on data-poisoning attacks in large language models

Softmax Linear Units

— · /research/softmax-linear-units

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Specific versus General Principles for Constitutional AI

— · /research/specific-versus-general-principles-for-constitutional-ai

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

A statistical approach to model evaluations

— · /research/statistical-approach-to-model-evals

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Studying Large Language Model Generalization with Influence Functions

— · /research/studying-large-language-model-generalization-with-influence-functions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Superposition, Memorization, and Double Descent

— · /research/superposition-memorization-and-double-descent

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude SWE-Bench Performance

— · /research/swe-bench-sonnet

Explore Claude's breakthrough performance on SWE-Bench, demonstrating advanced software engineering capabilities and code generation accuracy. Learn about our technical evaluation

Alignment Research

— · /research/alignment

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Economic Research

— · /research/economic-research

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Interpretability Research

— · /research/interpretability

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Societal Impacts Research

— · /research/societal-impacts

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

The Capacity for Moral Self-Correction in Large Language Models

— · /research/the-capacity-for-moral-self-correction-in-large-language-models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Towards Measuring the Representation of Subjective Global Opinions in Language Models

— · /research/towards-measuring-the-representation-of-subjective-global-opinions-in-language-models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

— · /research/towards-monosemanticity-decomposing-language-models-with-dictionary-learning

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Towards Understanding Sycophancy in Language Models

— · /research/towards-understanding-sycophancy-in-language-models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Toy Models of Superposition

— · /research/toy-models-of-superposition

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Tracing the thoughts of a large language model

— · /research/tracing-thoughts-language-model

Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

— · /research/training-a-helpful-and-harmless-assistant-with-reinforcement-learning-from-human-feedback

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Reflections on Qualitative Research

— · /research/transformer-circuits

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Trustworthy agents in practice

— · /research/trustworthy-agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Values in the wild: Discovering and analyzing values in real-world language model interactions

— · /research/values-wild

An Anthropic research paper testing which values AI models express in the real world

Vibe physics: The AI grad student

— · /research/vibe-physics

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Claude's extended thinking

— · /research/visible-extended-thinking

Discussing Claude's new thought process

Engineering24