Discover AI
banner
ai4you.bsky.social
Discover AI
@ai4you.bsky.social
I love complex artificial intelligence systems. Every day a new surprise. Every day up to 800 new research pre-prints to design new systems, from medical to financial to social AI. I do believe, that science should be open to everybody.
DeepSeek has released a new version of R1, called R1 0528.

No R2, but an improved R1.
I already tested it on complex reasoning, and this open source model is impressive.

youtu.be/toailYMTAKo?...
NEW DeepSeek R1 0528 vs o3 = WOW!
YouTube video by Discover AI
youtu.be
May 29, 2025 at 2:21 PM
New video on creating a new app - for my AI research papers.

No cursor, no windsurf, no bolt, no lovable, no replit.

Google has a new service: try it for free in AI Studio.

Code your app in 15 minutes, from scratch:

youtu.be/x7zrS6xXmgM?...
Vibe Coding PROFESSIONAL: Build App in 15 min ($0)
YouTube video by Discover AI
youtu.be
May 25, 2025 at 3:32 PM
New Qwen 3 30B MoE A3B model is really interesting.

I performed a cascading logic test on it, in thinking and non- thinking mode.

Live recording
youtu.be/u-WXyeV1tsw?...
Surprising Performance of SMALL Qwen3-A3B MoE
YouTube video by Discover AI
youtu.be
April 29, 2025 at 8:57 PM
Current AI methods like SFT or RL are not optimized for quantum computing.

New Quantum AI Framework has been published. AI comes closer to theoretical physics.

Further insights here
youtu.be/vIb3B0PIklE?...
Quantum AI: New Framework
YouTube video by Discover AI
youtu.be
April 27, 2025 at 2:10 PM
If you want to learn HOW to CODE agent and multi-agent systems, today is perfect time to start.

Google released a new Agent framework: ADK. Maybe the most efficient way to code agents with tools, like internet search, MCP and so much more.

youtu.be/Geo8LzCHoMQ?...
I Learn How to Code Agents w/ Google's NEW ADK
YouTube video by Discover AI
youtu.be
April 10, 2025 at 2:28 PM
META AI is in trouble.

The REAL Llama 4 models fail to perform as marketed.

"Optimized" Llama 4 for Benchmark Tests only.

#llama4

youtube.com/post/UgkxyO3...
Post from Discover AI - YouTube
Hi community, "Meta got caught gaming AI benchmarks" is a title from The Verge, that reported, the Llama 4 model on LMarena is not the model we as AI communi...
youtube.com
April 9, 2025 at 7:04 AM
New Llama 4 Maverick 400B model, official benchmark:

youtube.com/post/UgkxWRX...
Post from Discover AI - YouTube
Hi community, Llama 4 MAVERICK 400B ranks #29 on AIDER benchmark. Aider just published the new benchmark results for Llama 4 MAVERICK 400B (17B, 128) MoE. I ...
youtube.com
April 7, 2025 at 5:39 PM
I am really interested in the new AI research by DeepSeek.

They published a new Reward model for their next Reasoning model:

Explained in detail here

youtu.be/9KMxNZ2CvUg?...
NEW by DeepSeek: SPCT w/ DeepSeek-GRM-27B
YouTube video by Discover AI
youtu.be
April 7, 2025 at 5:36 PM
Llama 4 Maverick 400B MoE

Tested on logic and causal reasoning: press a sequence of 5 elevator buttons to go up.

Llama 4 400B failed ... 6 times .... and got frustrated with me!

Live recording of Llama 4 performance

youtu.be/8G-GI4bvWZU?...
TEST Llama 4 Maverick 400B: Enjoy the Silence
YouTube video by Discover AI
youtu.be
April 6, 2025 at 4:33 PM
First real world TEST

Llama 4 Maverick 400B

On causal reasoning
#llama4

New video

youtu.be/12lAM-xPvu8?...
Llama 4 Maverick 400B: First Real-World TEST
YouTube video by Discover AI
youtu.be
April 6, 2025 at 4:31 PM
New DeepSeek V3 0324 has an impressive performance, but is not a reasoning LLM.

So I manually build a Single Task R2 system - out of DeepSeek V3 0324.

I show you all the steps for a single Task R2 - if you want to upgrade already.

youtu.be/TxtSD8DDqKk?...
Single-Task R2 from DeepSeek V3 0324 - an Experiment
YouTube video by Discover AI
youtu.be
March 26, 2025 at 5:40 PM
New small DeepSeek R1 Models.

New 14B and 32B Light-R1 models for local use. Open- source.

youtu.be/FAg4v2xaLYc?...
Improved DeepSeek R1-32B & R1-14B: NEW Light-R1
YouTube video by Discover AI
youtu.be
March 17, 2025 at 4:26 PM
Combine your local LLM with a cloud based reasoning #LLM.

New protocol to run your Ollama DeepSeek #R1 locally and only use #Sonnet 3.7 for the heavy thinking.

#Stanford Univ open sourced a new code

youtu.be/L-WfRaSPE2A?...
EASY: Multi-LLM Protocol for Local & Cloud AI (Stanford)
YouTube video by Discover AI
youtu.be
February 27, 2025 at 6:57 PM
The old dream: Ai will do research for us.

Google's new Co-scientist with 7 interacting special AI agents is the latest try.

But be careful: to automate scientific research with AI can have massive side effects.

New video

youtu.be/TUo1VeeBgOU?...
Google Stanford AI Co-Scientist: The SCIENCE YOU want
YouTube video by Discover AI
youtu.be
February 25, 2025 at 4:29 PM
Maybe Grok 3 or Sonnet 3.7 is not a good choice for your AI agents.

I will show you a new optimization where given a task, multi LLMs will be selected in a multi agent config - according to their performance. Not their Hype.

New video:

youtu.be/7HxDU8K59k8?...
Apply LLMSelector to your AI Agents (Tutorial & Code)
YouTube video by Discover AI
youtu.be
February 25, 2025 at 4:25 PM
Stanford Univ innovated tool use for AI agents.

Since agents can integrate other specialized AI agents as tools, a new method for multi tool use was invented - without the need to train any Model or supervisor agent.

New video

youtu.be/4828sGfx7dk?...
Better than AutoGen & LangChain: OctoTools (Stanford AI)
YouTube video by Discover AI
youtu.be
February 25, 2025 at 4:21 PM
Grok 3 is now free for 10 queries per day

First tests w/ Grok 3 THINK (the deep reasoning mode) - similar to DeepSeek R1 reasoning

youtu.be/1trUPXnREmA?...
3 Logic TESTS of GROK 3 - THINK mode (hardest problems in math)
YouTube video by Discover AI
youtu.be
February 20, 2025 at 4:12 PM
Hi community,
Today Perplexity.AI gave us their Deep Research Engine. Free Deep Research!!!

Instead of OpenAI's $200 option or Google's advanced option, Perplexity offers 5 free runs of Deep Research per day.

I did live testing - regarding the latest AI research topics:

youtu.be/Z9IpO3TTskU?...
NEW: FREE "Deep Research" by Perplexity - LIVE TEST
YouTube video by Discover AI
youtu.be
February 15, 2025 at 12:15 PM
AI Clones of human individuals are rather easy to code.

Next level is to code an AI Agent with the personal values, individual characteristics and private thoughts of an individual.

Deep reasoning is the way to explore those - for an AI Agent representing YOU

New video

youtu.be/gnJqsO8Mm1w?...
AI Digital Clone Next Level: AI Represents YOU
YouTube video by Discover AI
youtu.be
February 14, 2025 at 7:53 PM
If you want to see a product presentation by OpenAI, that is really special, why not have a look at OpenAI's homepage for the new product: Deep Research.

I compare the performance of the new Deep Search to a human and to a vanilla ChatGPT (free version).

youtu.be/tLnZBUuxNAI?...
AI DISASTER: Product DEMO by OpenAI - Deep Research
YouTube video by Discover AI
youtu.be
February 7, 2025 at 4:47 PM
There is a new OPEN R1 initiative.

Yes, DeepSeek R1 is open- source, but some secrets still remain.

Open R1 is a new effort by the open-source community, to uncover the complete complexity of the latest AI.

More details and how you can interact:

youtu.be/2ENvGkkK36E?...
Last Secrets Uncovered: OPEN DeepSeek R1
YouTube video by Discover AI
youtu.be
January 29, 2025 at 1:08 PM
Improved AI reasoning with knowledge graphs and multi agent systems.

Improve on your Knowledge Graphs in GraphRAG.

The idea is simple: instead of planning your path node by node, calculate community to community.

Faster, cheaper and more efficient:

youtu.be/DoI4nWQuywI?...
SMARTER: AI Reasoning w Knowledge Graphs + Agents
YouTube video by Discover AI
youtu.be
January 28, 2025 at 7:43 PM
Fact checking is essential.
Especially for AI systems.

To reduce AI hallucinations, new research on AI internal fact checking has been published.

The ultimate AI Fact checking method? It comes from medical record fact checking!

This video explains it:

youtu.be/ry3R7k6x1Pg?...
ULTIMATE Fact Checking AI (John Hopkins, Stanford)
YouTube video by Discover AI
youtu.be
January 28, 2025 at 7:38 PM
Can we really learn from LLMs? Can they become our learning engines?

I am perform a live test: OpenAI on vs DeepSeek R1 on learning and explaining new AI research topics.

What do you think? Is it worth paying triple prices for o1?

Here a direct comparison of o1 and R1

youtu.be/HM92mmG6YTs?...
PAY MORE for Intelligence? DeepSeek R1 vs o1 LIVE TEST
YouTube video by Discover AI
youtu.be
January 23, 2025 at 2:31 PM
A performance comparison of new #Gemini Thinking 01-21 LLM (published today) and DeepSeek R1 (published yesterday). #R1 #Reasoning

If you want to see a frustrated LLM that went into deep #CoT on my reasoning task and declares: I GIVE UP! ...have a look at my new video:

youtu.be/jb6egub3JDk?...
ANGRY: NEW Gemini Thinking vs R1 DeepSeek (on Logic)
YouTube video by Discover AI
youtu.be
January 22, 2025 at 3:39 PM