No R2, but an improved R1.
I already tested it on complex reasoning, and this open source model is impressive.
youtu.be/toailYMTAKo?...
No R2, but an improved R1.
I already tested it on complex reasoning, and this open source model is impressive.
youtu.be/toailYMTAKo?...
No cursor, no windsurf, no bolt, no lovable, no replit.
Google has a new service: try it for free in AI Studio.
Code your app in 15 minutes, from scratch:
youtu.be/x7zrS6xXmgM?...
No cursor, no windsurf, no bolt, no lovable, no replit.
Google has a new service: try it for free in AI Studio.
Code your app in 15 minutes, from scratch:
youtu.be/x7zrS6xXmgM?...
I performed a cascading logic test on it, in thinking and non- thinking mode.
Live recording
youtu.be/u-WXyeV1tsw?...
I performed a cascading logic test on it, in thinking and non- thinking mode.
Live recording
youtu.be/u-WXyeV1tsw?...
New Quantum AI Framework has been published. AI comes closer to theoretical physics.
Further insights here
youtu.be/vIb3B0PIklE?...
New Quantum AI Framework has been published. AI comes closer to theoretical physics.
Further insights here
youtu.be/vIb3B0PIklE?...
Google released a new Agent framework: ADK. Maybe the most efficient way to code agents with tools, like internet search, MCP and so much more.
youtu.be/Geo8LzCHoMQ?...
Google released a new Agent framework: ADK. Maybe the most efficient way to code agents with tools, like internet search, MCP and so much more.
youtu.be/Geo8LzCHoMQ?...
The REAL Llama 4 models fail to perform as marketed.
"Optimized" Llama 4 for Benchmark Tests only.
#llama4
youtube.com/post/UgkxyO3...
The REAL Llama 4 models fail to perform as marketed.
"Optimized" Llama 4 for Benchmark Tests only.
#llama4
youtube.com/post/UgkxyO3...
They published a new Reward model for their next Reasoning model:
Explained in detail here
youtu.be/9KMxNZ2CvUg?...
They published a new Reward model for their next Reasoning model:
Explained in detail here
youtu.be/9KMxNZ2CvUg?...
Tested on logic and causal reasoning: press a sequence of 5 elevator buttons to go up.
Llama 4 400B failed ... 6 times .... and got frustrated with me!
Live recording of Llama 4 performance
youtu.be/8G-GI4bvWZU?...
Tested on logic and causal reasoning: press a sequence of 5 elevator buttons to go up.
Llama 4 400B failed ... 6 times .... and got frustrated with me!
Live recording of Llama 4 performance
youtu.be/8G-GI4bvWZU?...
Llama 4 Maverick 400B
On causal reasoning
#llama4
New video
youtu.be/12lAM-xPvu8?...
Llama 4 Maverick 400B
On causal reasoning
#llama4
New video
youtu.be/12lAM-xPvu8?...
So I manually build a Single Task R2 system - out of DeepSeek V3 0324.
I show you all the steps for a single Task R2 - if you want to upgrade already.
youtu.be/TxtSD8DDqKk?...
So I manually build a Single Task R2 system - out of DeepSeek V3 0324.
I show you all the steps for a single Task R2 - if you want to upgrade already.
youtu.be/TxtSD8DDqKk?...
New 14B and 32B Light-R1 models for local use. Open- source.
youtu.be/FAg4v2xaLYc?...
New 14B and 32B Light-R1 models for local use. Open- source.
youtu.be/FAg4v2xaLYc?...
New protocol to run your Ollama DeepSeek #R1 locally and only use #Sonnet 3.7 for the heavy thinking.
#Stanford Univ open sourced a new code
youtu.be/L-WfRaSPE2A?...
New protocol to run your Ollama DeepSeek #R1 locally and only use #Sonnet 3.7 for the heavy thinking.
#Stanford Univ open sourced a new code
youtu.be/L-WfRaSPE2A?...
Google's new Co-scientist with 7 interacting special AI agents is the latest try.
But be careful: to automate scientific research with AI can have massive side effects.
New video
youtu.be/TUo1VeeBgOU?...
Google's new Co-scientist with 7 interacting special AI agents is the latest try.
But be careful: to automate scientific research with AI can have massive side effects.
New video
youtu.be/TUo1VeeBgOU?...
I will show you a new optimization where given a task, multi LLMs will be selected in a multi agent config - according to their performance. Not their Hype.
New video:
youtu.be/7HxDU8K59k8?...
I will show you a new optimization where given a task, multi LLMs will be selected in a multi agent config - according to their performance. Not their Hype.
New video:
youtu.be/7HxDU8K59k8?...
Since agents can integrate other specialized AI agents as tools, a new method for multi tool use was invented - without the need to train any Model or supervisor agent.
New video
youtu.be/4828sGfx7dk?...
Since agents can integrate other specialized AI agents as tools, a new method for multi tool use was invented - without the need to train any Model or supervisor agent.
New video
youtu.be/4828sGfx7dk?...
First tests w/ Grok 3 THINK (the deep reasoning mode) - similar to DeepSeek R1 reasoning
youtu.be/1trUPXnREmA?...
First tests w/ Grok 3 THINK (the deep reasoning mode) - similar to DeepSeek R1 reasoning
youtu.be/1trUPXnREmA?...
Today Perplexity.AI gave us their Deep Research Engine. Free Deep Research!!!
Instead of OpenAI's $200 option or Google's advanced option, Perplexity offers 5 free runs of Deep Research per day.
I did live testing - regarding the latest AI research topics:
youtu.be/Z9IpO3TTskU?...
Today Perplexity.AI gave us their Deep Research Engine. Free Deep Research!!!
Instead of OpenAI's $200 option or Google's advanced option, Perplexity offers 5 free runs of Deep Research per day.
I did live testing - regarding the latest AI research topics:
youtu.be/Z9IpO3TTskU?...
Next level is to code an AI Agent with the personal values, individual characteristics and private thoughts of an individual.
Deep reasoning is the way to explore those - for an AI Agent representing YOU
New video
youtu.be/gnJqsO8Mm1w?...
Next level is to code an AI Agent with the personal values, individual characteristics and private thoughts of an individual.
Deep reasoning is the way to explore those - for an AI Agent representing YOU
New video
youtu.be/gnJqsO8Mm1w?...
I compare the performance of the new Deep Search to a human and to a vanilla ChatGPT (free version).
youtu.be/tLnZBUuxNAI?...
I compare the performance of the new Deep Search to a human and to a vanilla ChatGPT (free version).
youtu.be/tLnZBUuxNAI?...
Yes, DeepSeek R1 is open- source, but some secrets still remain.
Open R1 is a new effort by the open-source community, to uncover the complete complexity of the latest AI.
More details and how you can interact:
youtu.be/2ENvGkkK36E?...
Yes, DeepSeek R1 is open- source, but some secrets still remain.
Open R1 is a new effort by the open-source community, to uncover the complete complexity of the latest AI.
More details and how you can interact:
youtu.be/2ENvGkkK36E?...
Improve on your Knowledge Graphs in GraphRAG.
The idea is simple: instead of planning your path node by node, calculate community to community.
Faster, cheaper and more efficient:
youtu.be/DoI4nWQuywI?...
Improve on your Knowledge Graphs in GraphRAG.
The idea is simple: instead of planning your path node by node, calculate community to community.
Faster, cheaper and more efficient:
youtu.be/DoI4nWQuywI?...
Especially for AI systems.
To reduce AI hallucinations, new research on AI internal fact checking has been published.
The ultimate AI Fact checking method? It comes from medical record fact checking!
This video explains it:
youtu.be/ry3R7k6x1Pg?...
Especially for AI systems.
To reduce AI hallucinations, new research on AI internal fact checking has been published.
The ultimate AI Fact checking method? It comes from medical record fact checking!
This video explains it:
youtu.be/ry3R7k6x1Pg?...
I am perform a live test: OpenAI on vs DeepSeek R1 on learning and explaining new AI research topics.
What do you think? Is it worth paying triple prices for o1?
Here a direct comparison of o1 and R1
youtu.be/HM92mmG6YTs?...
I am perform a live test: OpenAI on vs DeepSeek R1 on learning and explaining new AI research topics.
What do you think? Is it worth paying triple prices for o1?
Here a direct comparison of o1 and R1
youtu.be/HM92mmG6YTs?...
If you want to see a frustrated LLM that went into deep #CoT on my reasoning task and declares: I GIVE UP! ...have a look at my new video:
youtu.be/jb6egub3JDk?...
If you want to see a frustrated LLM that went into deep #CoT on my reasoning task and declares: I GIVE UP! ...have a look at my new video:
youtu.be/jb6egub3JDk?...