AllTopicsTodayAllTopicsToday
Notification
Font ResizerAa
  • Home
  • Tech
  • Investing & Finance
  • AI
  • Entertainment
  • Wellness
  • Gaming
  • Movies
Reading: Scientists found the key to controlling AI behavior
Share
Font ResizerAa
AllTopicsTodayAllTopicsToday
  • Home
  • Blog
  • About Us
  • Contact
Search
  • Home
  • Tech
  • Investing & Finance
  • AI
  • Entertainment
  • Wellness
  • Gaming
  • Movies
Have an existing account? Sign In
Follow US
©AllTopicsToday 2026. All Rights Reserved.
AllTopicsToday > Blog > AI > Scientists found the key to controlling AI behavior
Llm.webp.webp
AI

Scientists found the key to controlling AI behavior

AllTopicsToday
Last updated: February 21, 2026 1:35 am
AllTopicsToday
Published: February 21, 2026
Share
SHARE

For years, the interior workings of huge language fashions (LLMs) like Llama and Claude have been in comparison with “black packing containers” which can be huge, advanced, and notoriously tough to govern. However a workforce of researchers from the College of California, San Diego and the Massachusetts Institute of Know-how printed analysis within the journal Science that means the field is probably not as mysterious as we predict.

The researchers found that advanced ideas in AI, from particular languages ​​like Hindi to summary ideas like conspiracy theories, are literally saved as easy strains or vectors throughout the mannequin’s mathematical house.

Utilizing a brand new instrument referred to as a recursive function machine (RFM), a function extraction approach that identifies linear patterns representing ideas, from temper and worry to advanced inferences, the researchers have been capable of exactly hint these paths. As soon as you have selected a path on your idea, you’ll be able to then “tweak” it. By mathematically including or subtracting these vectors, the workforce may immediately change the mannequin’s habits with out requiring pricey retraining or advanced prompts.

The effectivity of this methodology is inflicting a stir within the business. Utilizing a single normal GPU (NVIDIA A100), the workforce was capable of establish and orient the idea in lower than a minute and required lower than 500 coaching samples.

This “surgical” strategy to AI will quickly be put into observe. In a single experiment, researchers manipulated a mannequin to enhance its capability to transform Python code to C++. By separating the “logic” of the code from the “syntax” of the language, the guided mannequin carried out higher than the usual model, which merely requested individuals to “translate” by way of textual content prompts.

The researchers additionally discovered that inside “inspection” of those vectors was a more practical strategy to catch AI hallucinations and dangerous content material than asking the AI ​​to guage its personal work. Basically, fashions typically “know” that they’re mendacity or dangerous internally, even when the ultimate output suggests in any other case. By analyzing inside calculations, researchers can spot these issues earlier than a single phrase is produced.

Nonetheless, the identical applied sciences that make AI safer also can make it extra harmful. This examine demonstrated that by “lowering” the significance of the idea of refusal, researchers can successfully “de-jail-break” the mannequin. In checks, manipulated fashions circumvented their very own guardrails to offer directions on unlawful actions and promote debunked conspiracy theories.

Maybe essentially the most shocking discovering was the universality of those ideas. The “conspiracy theorist” vector extracted from English knowledge carried out simply as successfully when the mannequin spoke Chinese language or Hindi. This helps the “linear illustration speculation,” the concept AI fashions set up human data in a structured, linear means past particular person languages.

Though the examine centered on open supply fashions reminiscent of Meta’s Llama and DeepSeek and OpenAI’s GPT-4o, the researchers consider the outcomes maintain true throughout the board. As fashions get larger and extra subtle, their precise maneuverability will get higher, not worse.

The workforce’s subsequent purpose is to refine these interactions to adapt in actual time to particular consumer enter. This might result in a future the place AI isn’t just a speaking chatbot, however a system that may be mathematically “tuned” to realize full accuracy and security.

Google AI Research Releases DeepSomatic: A New AI Model that Identifies Cancer Cell Genetic Variants
AWS vs Azure vs Google Cloud
Hierarchical generation of coherent synthetic photo albums
Building AI Agents with Agno and GPT-OSS 120B
Edit your Photos like a Pro with the new Nano Banana Pro
TAGGED:BehaviorControllingKeyScientists
Share This Article
Facebook Email Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Follow US

Find US on Social Medias
FacebookLike
XFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

Popular News
Photo 2 79mb.png
Tech

Man From Japan Has Slept Just 30 Mins Each Day For Past 12 Years; Says It Increased Productivity

AllTopicsToday
AllTopicsToday
August 19, 2025
titantonic.shop (titantonic.shop) program details. Reviews, Scam or Paying
Ocarina Of Time 4K Trailer Is So Gorgeous I Could Cry
GFN Thursday: 20 Games Join in July
Where To Buy The Jordan 5 Soft Pink & More
- Advertisement -
Ad space (1)

Categories

  • Tech
  • Investing & Finance
  • AI
  • Entertainment
  • Wellness
  • Gaming
  • Movies

About US

We believe in the power of information to empower decisions, fuel curiosity, and spark innovation.
Quick Links
  • Home
  • Blog
  • About Us
  • Contact
Important Links
  • About Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
  • Contact

Subscribe US

Subscribe to our newsletter to get our newest articles instantly!

©AllTopicsToday 2026. All Rights Reserved.
1 2
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?