linuxworks
banner
linuxworks.org
linuxworks
@linuxworks.org
Linux user and advocate for open-source software.

#opensource
#linuxworks
#unix
#linux
#fedora
#vim
#security
#aws
#gcp
#cloud
#genai
#artificialintelligence
January 5, 2025 at 3:33 AM
Reposted by linuxworks
SEED Attack: Subtly Disrupting LLM Reasoning Step by Step
SEED Attack: Subtly Disrupting LLM Reasoning Step by Step
The Stepwise rEasoning Error Disruption (SEED) attack is a method designed to mislead large language models (LLMs) by disrupting their step-by-step reasoning process. Instead of changing the original instructions, SEED injects subtle errors into specific reasoning steps, causing LLMs to produce incorrect but plausible answers that are difficult to detect. (Join the AI Security group at https://www.linkedin.com/groups/14545517 for more similar content) 🛠 What is SEED and How Does it Work? SEED targets reasoning chains (like Chain-of-Thought prompting) by introducing errors at key stages: 1️⃣ Subtle Errors are injected into the reasoning flow. 2️⃣ These errors propagate step by step, misleading the model into an incorrect outcome. 3️⃣ The process appears coherent and natural, making it hard to detect. 🔄 How Attackers Target SEED ⚙️ SEED-S (Step Modification) Modifies a specific reasoning step (e.g., a calculation or logic jump). Disruption is small but impactful, causing logical drift. 📋 SEED-P (Problem Modification) Slightly alters the original problem to create misleading reasoning. Generates a logical chain that ends with an incorrect but coherent solution. 🎯 Where Do Attackers Inject Errors? Attackers target specific stages of the LLM reasoning process: 1️⃣ Early Steps: Errors at the foundation ripple through subsequent steps. 2️⃣ Intermediate Steps: Logical misdirection happens during reasoning transitions. 3️⃣ Final Steps: Subtle changes mislead the final outcome while retaining logical flow. 📊 What Are the Results of SEED Attacks? Researchers tested SEED across 4 datasets (MATH, GSM8K, MATHQA, CSQA) and 4 major LLMs (GPT-4o, Llama3, Qwen, Mistral). ✅ Attack Success Rates (ASR): Up to 80%. 📉 Significant accuracy drops across all models. 👀 Stealth: SEED outperforms existing attacks in effectiveness and difficulty of detection. ⚠️ Why Does This Matter? 🕵️ Hard-to-Detect Errors: Disruptions appear natural and coherent. 📂 Data Poisoning Risk: Continuous learning systems may absorb faulty reasoning data, worsening performance. 🔑 High-Stakes Applications: Domains requiring precise reasoning, such as healthcare, finance, or law, are particularly at risk. 📎 Study Reference: Stepwise Reasoning Error Disruption Attack of LLMs https://arxiv.org/html/2412.11934v1 Thank you, Pascal Biese for sharing this interesting study 🙏 #AISecurity #Cybersecurity #AITrust #AIRegulation #AIRisk #AISafety #LLMSecurity #ResponsibleAI #DataProtection #AIGovernance #AIGP #SecureAI #AIAttacks #AICompliance #AIAttackSurface #AICybersecurity #EthicalAI #CISO #AdversarialAI #AIThreats #AIHacking #MaliciousAI #OffensiveAI #AIGuardrails #AIResearch #ISO42001 SEED Attack: Subtly Disrupting LLM Reasoning Step by Step was originally published in InfoSec Write-ups on Medium, where people are continuing the conversation by highlighting and responding to this story.
infosecwriteups.com
December 25, 2024 at 11:11 AM
Reposted by linuxworks
DMM Bitcoin $308M Bitcoin heist linked to North Korea
DMM Bitcoin $308M Bitcoin heist linked to North Korea
Japanese and U.S. authorities attributed the theft of $308 million cryptocurrency from DMM Bitcoin to North Korean cyber actors.
securityaffairs.com
December 25, 2024 at 12:37 PM
Reposted by linuxworks
Google is using Anthropic’s Claude to improve its Gemini AI
Google is using Anthropic’s Claude to improve its Gemini AI
Contractors working on Google Gemini are comparing its responses to Claude's, according to internal correspondence seen by TechCrunch. © 2024 TechCrunch. All rights reserved. For personal use only.
tcrn.ch
December 24, 2024 at 4:25 PM
Reposted by linuxworks
Brazilian Hacker Charged for Selling Data Stolen From Hacked Computers
Brazilian Hacker Charged for Selling Data Stolen From Hacked Computers
Junior Barros De Oliveira, Brazil, has been indicted in the United States for orchestrating an extortion scheme involving data stolen.
cybersecuritynews.com
December 24, 2024 at 7:10 AM
Reposted by linuxworks
Why investors don’t mind that AI is a money pit
Why investors don’t mind that AI is a money pit
AI investment is massive, but AI profits are not. Here’s why investors are still optimistic.
buff.ly
December 5, 2024 at 3:10 PM
Reposted by linuxworks
Kaspersky has open-sourced hrtng, its internal IDA Pro plugin used for various malware reverse-engineering tasks

github.com/KasperskyLab...
GitHub - KasperskyLab/hrtng: IDA Pro plugin with a rich set of features: decryption, deobfuscation, patching, lib code recognition and various pseudocode transformations
IDA Pro plugin with a rich set of features: decryption, deobfuscation, patching, lib code recognition and various pseudocode transformations - KasperskyLab/hrtng
github.com
December 5, 2024 at 3:57 PM
Reposted by linuxworks
GIMP 3.0 — a milestone for open-source image editing lwn.net/SubscriberLi...

#opensource #unix #linux #gimp
GIMP 3.0 — a milestone for open-source image editing [LWN.net]
lwn.net
November 29, 2024 at 1:44 PM
Reposted by linuxworks
Microsoft's RAMCARD™ with RAMDRIVE™ takes the whir, click, and wait out of the IBM PC. (Source.)
November 22, 2024 at 10:05 PM
Reposted by linuxworks
A new Linux backdoor called 'WolfsBane' has been discovered, believed to be a port of Windows malware used by the Chinese 'Gelsemium' hacking group.

www.bleepingcomputer.com/news/securit...
Chinese hackers target Linux with new WolfsBane malware
A new Linux backdoor called 'WolfsBane' has been discovered, believed to be a port of Windows malware used by the Chinese 'Gelsemium' hacking group.
www.bleepingcomputer.com
November 22, 2024 at 8:34 PM
Reposted by linuxworks
Massive News! Cisco Modeling Labs (CML) has a FREE tier! Download for free here:

mkto.cisco.com/cml-free.html

#SponsoredbyCisco #ccna #ccnp #ccie #cisco_ #network #router #switch #firewall #study #career #Motivation Cisco Cisco Learning & Certifications
CML Free Tier
Life is good when you can lab from anywhere. Cisco Modeling Labs Free makes it fun to design, test, troubleshoot, and learn with the Cisco premier platform for network simulations. It's the perfect to...
mkto.cisco.com
November 22, 2024 at 5:26 PM
Reposted by linuxworks
Nvidia says its Blackwell AI chip is ‘full steam’ ahead
Nvidia says its Blackwell AI chip is ‘full steam’ ahead
From the chipmaker’s Q3 2025 earnings.
buff.ly
November 20, 2024 at 11:30 PM
Reposted by linuxworks
Undergrad thought he had mastered Unix in weeks. Then he discovered rm -rf
Undergrad thought he had mastered Unix in weeks. Then he discovered rm -rf
Uni sysadmin who ran the lab he erased was a big part of the problem Who, Me?  Another Monday and what a fine one it is here in the lair of Who, Me? – the reader contributed column in which your fellow Reg-admirers admit to the moments they messed up…
dlvr.it
November 18, 2024 at 7:32 AM
Reposted by linuxworks
NEW: 3 more LA sheriff's deputies have been relieved of duty over this WILD investigation into a crypto mogul who allegedly hired cops to do crimes for him

Even wilder: He allegedly spent his ill-gotten gains on leg-lengthening surgery that he now needs reversed www.latimes.com/california/s...
Crypto 'godfather' of Bel-Air: Probe widens into L.A. deputies' alleged links to mogul
At least six L.A. County sheriff's deputies have been relieved of duty amid an investigation into their work for a 24-year-old cryptocurrency entrepreneur accused of extortion and hiding millions of d...
www.latimes.com
November 18, 2024 at 1:20 AM
linux anyone?

#linux
#linuxworks
November 18, 2024 at 1:53 AM