AI News
  • Home
  • AI & Tech
  • Machine Learning
  • Startups
  • Tools & Apps
  • Robotics
  • Future Tech
  • AI in Industry
    • AI in Sport ⚽
    • AI in Health
    • AI in Education
    • AI in Finance
    • AI in Business
    • AI in Law
    • AI in Climate
No Result
View All Result
SAVED POSTS
AI News
  • Home
  • AI & Tech
  • Machine Learning
  • Startups
  • Tools & Apps
  • Robotics
  • Future Tech
  • AI in Industry
    • AI in Sport ⚽
    • AI in Health
    • AI in Education
    • AI in Finance
    • AI in Business
    • AI in Law
    • AI in Climate
No Result
View All Result
AI News
No Result
View All Result

Anthropic says its new model uses more AI to shape its own character

Ramo by Ramo
27 June 2026
in AI & Tech
401 21
0
Anthropic says its new model uses more AI to shape its own character
585
SHARES
3.2k
VIEWS
Summarize with ChatGPTShare to Facebook
Anthropic says its new model uses more AI to shape its own character

Anthropic, the AI company behind the Claude family of large language models, is publicly releasing a new method for tuning model behavior that it calls Character Shaping. Unlike the standard fine-tuning process where human trainers manually label ideal outputs, this technique leans on a separate AI model to steer the behavior of the primary model during a specialized training phase.

How Character Shaping works in practice

The company explains that Character Shaping operates as a layer on top of the standard reinforcement learning from human feedback, or RLHF, pipeline. In typical RLHF, human raters compare responses from the model and create a reward signal that teaches the model which types of answers are preferred. Anthropic found that this process can be made more flexible by introducing a second, smaller AI model that acts as a kind of guide. This guide model, which itself has been trained on a set of principles or personality traits selected by the developer, scores the primary model’s responses during training. The primary model then adjusts its behavior to maximize the scores given by this guide model.

The key insight Anthropic is promoting is that this guide model can be updated or swapped out without having to retrain the entire large language model from scratch. A developer could, for example, create a guide model that prioritizes concise answers, then swap it for one that rewards more verbose explanations, and the main model would adapt accordingly after a relatively small amount of additional training. Anthropic claims this approach reduces the time and cost associated with repeatedly collecting human preference data every time a company wants to tweak the tone or style of its chatbot.

📖
RECOMMENDED READ
The Coming Wave: AI, Power, and the Greatest Dilemma of Our Age
Mustafa Suleyman
The definitive book on where AI is heading - written by one of the field founders.
View on Amazon →affiliate link

Potential applications and developer control

Early tests from Anthropic suggest that Character Shaping produces noticeable differences in how Claude responds to user prompts. When the guide model was optimized for thoughtfulness, the primary model tended to produce more detailed and cautious answers. When the guide model was optimized for speed, the primary model gave shorter and more direct responses. The company states that the effect is consistent across a range of common queries, though it cautions that the technique does not replace the broader safety training that prevents harmful outputs. Character Shaping is intended as a tool for customizing the user experience, not for bypassing the core safety constraints that all Claude models share.

Anthropic has positioned this release as a way to give businesses and developers more granular control over the AI assistants they deploy. A customer service bot, for instance, could be shaped to be extremely polite and deferential, while a coding assistant could be shaped to be direct and terse. The company argues that this level of customization was previously only possible through extensive prompt engineering or large-scale custom fine-tuning, both of which can be brittle or expensive. Character Shaping, Anthropic says, offers a middle ground that is both more robust than prompt tricks and far cheaper than full retraining.

The company has published a technical overview of the method and is making the technique available through its API. Researchers outside of Anthropic will be able to experiment with the guide model approach and provide feedback on its strengths and limitations. Anthropic hopes this transparency will help the broader AI community develop better standards for controlling model behavior in a predictable and cost effective manner.

Character Shaping is an interesting step toward more modular AI control. Instead of treating the model’s personality as a fixed trait determined during the initial training run, Anthropic is showing that you can build a dial that developers can turn after the fact. The long term vision appears to be a world where different parts of a system have their own small shaper models, each responsible for a different facet of behavior, working together to produce a coherent but adaptable assistant. This is the kind of architectural thinking that could make future AI systems more manageable for the teams that build them and more useful for the people who interact with them. For more insights on how AI training methods are evolving, check out our coverage at {$link_text}.

Tags: AI trainingAnthropicCharacter ShapingClaudeRLHF
SummarizeShare234
Ramo

Ramo

Ramo is the editorial voice of Mylistingo — an AI and technology news platform based in The Hague, Netherlands. Covering artificial intelligence, machine learning, robotics, and the future of technology, Ramo delivers accurate, accessible reporting for both general audiences and industry professionals. Every article is fact-checked and written to meet Mylistingo's strict no-fabrication editorial standards.

Related Stories

Meta’s AI supercomputer rivals top 10 globally, aims for AGI

Meta’s AI supercomputer rivals top 10 globally, aims for AGI

by Ramo
27 June 2026
0

Meta unveils an AI supercomputer now among the world’s ten most powerful, built from scratch to train large models and push toward general intelligence.

ai coding competition shakes up developer hiring

ai coding competition shakes up developer hiring

by Ramo
26 June 2026
0

GitHub and Hugging Face launch a new AI coding competition to benchmark models and challenge how developers are hired in the industry.

how AI is rewriting the rules of content creation at scale

how AI is rewriting the rules of content creation at scale

by Ramo
26 June 2026
0

AI is transforming content production from drafting to distribution. This article explores the new tools and strategies shaping the future of digital media.

OpenAI says its new model is capable of reason

OpenAI says its new model is capable of reason

by Ramo
26 June 2026
0

OpenAI unveils Orion, a new AI model that can reason through complex problems, marking a shift from simple prediction to deeper logical thinking.

Recommended

Smartphone addiction among the young is undeniably real — Photo by cottonbro studio on Pexels

Smartphone addiction among the young is undeniably real

22 June 2026
SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift — Photo by SpaceX on Pexels

SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift

22 June 2026

Popular Story

  • How I Developed a Trading Indicator That Boasts Over 350% Returns—and How to Get It for Free — Photo by Саша Алалыкин on Pexels

    How I Developed a Trading Indicator That Boasts Over 350% Returns—and How to Get It for Free

    37 shares
    Share 477 Tweet 298
  • Is Your Home Truly Safe The Smart Security Tech You Need in 2025

    587 shares
    Share 235 Tweet 147
  • AI Takes the Field: Strikes, Horses, and the NBA Draft

    587 shares
    Share 235 Tweet 147
  • OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

    587 shares
    Share 235 Tweet 147
  • How AI Is Changing Sports Coaching in 2026

    586 shares
    Share 234 Tweet 147
Mylstingo

We bring you the best Premium WordPress Themes that perfect for news, magazine, personal blog, etc. Check our landing page for details.

Recent Posts

  • Could Israel’s coming election see an end to Netanyahu’s political career?
  • Ronaldo, Portugal play Colombia in World Cup: Prediction, kickoff, schedule
  • No school, living in a tent, but it’s exam time in Gaza

Categories

  • AI & Tech
  • AI in Business
  • AI in Climate
  • AI in Education
  • AI in Finance
  • AI in Health
  • AI in Law
  • AI in Sport
  • Future Tech
  • Machine Learning
  • Robotics
  • Startups
  • Tools & Apps
  • Uncategorized

Weekly Newsletter

  • Home
  • Latest News
  • Contact Us
  • Data Deletion Instructions
  • Editorial Policy

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • AI & Tech
  • Machine Learning
  • Startups
  • Tools & Apps
  • Robotics
  • Future Tech
  • AI in Industry
    • AI in Sport ⚽
    • AI in Health
    • AI in Education
    • AI in Finance
    • AI in Business
    • AI in Law
    • AI in Climate