Lightnews — Scholar-powered news

linuxworks

@linuxworks.org

January 5, 2025 at 3:33 AM

Reposted by linuxworks

InfoSec

@infosec.skyfleet.blue

SEED Attack: Subtly Disrupting LLM Reasoning Step by Step

The Stepwise rEasoning Error Disruption (SEED) attack is a method designed to mislead large language models (LLMs) by disrupting their step-by-step reasoning process. Instead of changing the original instructions, SEED injects subtle errors into specific reasoning steps, causing LLMs to produce incorrect but plausible answers that are difficult to detect. (Join the AI Security group at https://www.linkedin.com/groups/14545517 for more similar content) 🛠 What is SEED and How Does it Work? SEED targets reasoning chains (like Chain-of-Thought prompting) by introducing errors at key stages: 1️⃣ Subtle Errors are injected into the reasoning flow. 2️⃣ These errors propagate step by step, misleading the model into an incorrect outcome. 3️⃣ The process appears coherent and natural, making it hard to detect. 🔄 How Attackers Target SEED ⚙️ SEED-S (Step Modification) Modifies a specific reasoning step (e.g., a calculation or logic jump). Disruption is small but impactful, causing logical drift. 📋 SEED-P (Problem Modification) Slightly alters the original problem to create misleading reasoning. Generates a logical chain that ends with an incorrect but coherent solution. 🎯 Where Do Attackers Inject Errors? Attackers target specific stages of the LLM reasoning process: 1️⃣ Early Steps: Errors at the foundation ripple through subsequent steps. 2️⃣ Intermediate Steps: Logical misdirection happens during reasoning transitions. 3️⃣ Final Steps: Subtle changes mislead the final outcome while retaining logical flow. 📊 What Are the Results of SEED Attacks? Researchers tested SEED across 4 datasets (MATH, GSM8K, MATHQA, CSQA) and 4 major LLMs (GPT-4o, Llama3, Qwen, Mistral). ✅ Attack Success Rates (ASR): Up to 80%. 📉 Significant accuracy drops across all models. 👀 Stealth: SEED outperforms existing attacks in effectiveness and difficulty of detection. ⚠️ Why Does This Matter? 🕵️ Hard-to-Detect Errors: Disruptions appear natural and coherent. 📂 Data Poisoning Risk: Continuous learning systems may absorb faulty reasoning data, worsening performance. 🔑 High-Stakes Applications: Domains requiring precise reasoning, such as healthcare, finance, or law, are particularly at risk. 📎 Study Reference: Stepwise Reasoning Error Disruption Attack of LLMs https://arxiv.org/html/2412.11934v1 Thank you, Pascal Biese for sharing this interesting study 🙏 #AISecurity #Cybersecurity #AITrust #AIRegulation #AIRisk #AISafety #LLMSecurity #ResponsibleAI #DataProtection #AIGovernance #AIGP #SecureAI #AIAttacks #AICompliance #AIAttackSurface #AICybersecurity #EthicalAI #CISO #AdversarialAI #AIThreats #AIHacking #MaliciousAI #OffensiveAI #AIGuardrails #AIResearch #ISO42001 SEED Attack: Subtly Disrupting LLM Reasoning Step by Step was originally published in InfoSec Write-ups on Medium, where people are continuing the conversation by highlighting and responding to this story.

infosecwriteups.com

December 25, 2024 at 11:11 AM

Reposted by linuxworks

InfoSec

@infosec.skyfleet.blue

DMM Bitcoin $308M Bitcoin heist linked to North Korea

Japanese and U.S. authorities attributed the theft of $308 million cryptocurrency from DMM Bitcoin to North Korean cyber actors.

securityaffairs.com

December 25, 2024 at 12:37 PM

Reposted by linuxworks

TechCrunch

@techcrunch.com

Google is using Anthropic’s Claude to improve its Gemini AI

tcrn.ch

December 24, 2024 at 4:25 PM

Reposted by linuxworks

InfoSec

@infosec.skyfleet.blue

Brazilian Hacker Charged for Selling Data Stolen From Hacked Computers

Junior Barros De Oliveira, Brazil, has been indicted in the United States for orchestrating an extortion scheme involving data stolen.

cybersecuritynews.com

December 24, 2024 at 7:10 AM

linuxworks

@linuxworks.org

www.perplexity.ai/page/zuckerb...

Zuckerberg Fears Bluesky's Rise

According to recent reports, Bluesky's rapid growth has caught the attention of Meta CEO Mark Zuckerberg, prompting swift updates to Threads in an effort to...

www.perplexity.ai

December 7, 2024 at 3:16 PM

Reposted by linuxworks

The Verge

@theverge.com

Why investors don’t mind that AI is a money pit

AI investment is massive, but AI profits are not. Here’s why investors are still optimistic.

buff.ly

December 5, 2024 at 3:10 PM

Reposted by linuxworks

Catalin Cimpanu

@campuscodi.risky.biz

Kaspersky has open-sourced hrtng, its internal IDA Pro plugin used for various malware reverse-engineering tasks

github.com/KasperskyLab...

GitHub - KasperskyLab/hrtng: IDA Pro plugin with a rich set of features: decryption, deobfuscation, patching, lib code recognition and various pseudocode transformations

IDA Pro plugin with a rich set of features: decryption, deobfuscation, patching, lib code recognition and various pseudocode transformations - KasperskyLab/hrtng

github.com

December 5, 2024 at 3:57 PM

Reposted by linuxworks

nixCraft

@cyberciti.biz

GIMP 3.0 — a milestone for open-source image editing lwn.net/SubscriberLi...

#opensource #unix #linux #gimp

GIMP 3.0 — a milestone for open-source image editing [LWN.net]

lwn.net

November 29, 2024 at 1:44 PM

Reposted by linuxworks

Retro Computers

@retrocomps.bsky.social

Microsoft's RAMCARD™ with RAMDRIVE™ takes the whir, click, and wait out of the IBM PC. (Source.)

November 22, 2024 at 10:05 PM

Reposted by linuxworks

BleepingComputer

@bleepingcomputer.com

A new Linux backdoor called 'WolfsBane' has been discovered, believed to be a port of Windows malware used by the Chinese 'Gelsemium' hacking group.

www.bleepingcomputer.com/news/securit...

Chinese hackers target Linux with new WolfsBane malware

A new Linux backdoor called 'WolfsBane' has been discovered, believed to be a port of Windows malware used by the Chinese 'Gelsemium' hacking group.

www.bleepingcomputer.com

November 22, 2024 at 8:34 PM

Reposted by linuxworks

Johnny

@jorune00.bsky.social

Massive News! Cisco Modeling Labs (CML) has a FREE tier! Download for free here:

mkto.cisco.com/cml-free.html

#SponsoredbyCisco #ccna #ccnp #ccie #cisco_ #network #router #switch #firewall #study #career #Motivation Cisco Cisco Learning & Certifications

CML Free Tier

Life is good when you can lab from anywhere. Cisco Modeling Labs Free makes it fun to design, test, troubleshoot, and learn with the Cisco premier platform for network simulations. It's the perfect to...

mkto.cisco.com

November 22, 2024 at 5:26 PM

Reposted by linuxworks

The Verge

@theverge.com

Nvidia says its Blackwell AI chip is ‘full steam’ ahead

From the chipmaker’s Q3 2025 earnings.

buff.ly

November 20, 2024 at 11:30 PM

Reposted by linuxworks

The Register

@theregister.com

Undergrad thought he had mastered Unix in weeks. Then he discovered rm -rf

Uni sysadmin who ran the lab he erased was a big part of the problem Who, Me? Another Monday and what a fine one it is here in the lair of Who, Me? – the reader contributed column in which your fellow Reg-admirers admit to the moments they messed up…

dlvr.it

November 18, 2024 at 7:32 AM

Reposted by linuxworks

Keri Blakinger

@keribla.bsky.social

NEW: 3 more LA sheriff's deputies have been relieved of duty over this WILD investigation into a crypto mogul who allegedly hired cops to do crimes for him

Even wilder: He allegedly spent his ill-gotten gains on leg-lengthening surgery that he now needs reversed www.latimes.com/california/s...

Crypto 'godfather' of Bel-Air: Probe widens into L.A. deputies' alleged links to mogul

At least six L.A. County sheriff's deputies have been relieved of duty amid an investigation into their work for a 24-year-old cryptocurrency entrepreneur accused of extortion and hiding millions of d...

www.latimes.com

November 18, 2024 at 1:20 AM

linuxworks

@linuxworks.org

linux anyone?

#linux
#linuxworks

November 18, 2024 at 1:53 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news