Hyperplane
banner
hyperplane.bsky.social
Hyperplane
@hyperplane.bsky.social
Your weekly read. From POC to Production, at scale.

🫵 Follow our substack: https://thehyperplane.substack.com/
👀 Our Ebook: https://hyperplane.gumroad.com/l/fine-tuning-stt-on-edge
Pinned
Finally made it! It’s been a long ride, but the first real STT guide for kids’ voices on edge devices is here. Check it out on Gumroad if you're curious!

the link for the book mlvanguards.gumroad.com/l/fine-tunin... (it's free)
Finally made it! It’s been a long ride, but the first real STT guide for kids’ voices on edge devices is here. Check it out on Gumroad if you're curious!

the link for the book mlvanguards.gumroad.com/l/fine-tunin... (it's free)
April 2, 2025 at 2:05 PM
Without observability, agents are just black boxes making guesses.
With the right tools, they become transparent, testable, improvable systems.

Agentic systems are only as useful as their debug-ability.
March 31, 2025 at 7:00 AM
MLOps is more than just tools.

Reproducibility, scalability, and observability are a must. Just setting up Kubeflow or MLflow doesn’t make you an MLOps expert.
March 28, 2025 at 11:48 AM
Raw audio is unpredictable.
Training is expensive.
Inference is a balancing act.
Deployment is… well, never as simple as pip install.

Everyone’s hyped about LLMs and edge deployments, but few talk about what it actually takes to it to production.

So we wrote the guide we wish we had.
March 27, 2025 at 2:08 PM
Real MLOps doesn’t happen in 1 day or 1 week. Knowing how to use different tools doesn’t make you an MLOps expert.

It takes time to understand the whole process, and it takes time to learn how to have arguments to convince the management, the CEO, or other stakeholders.
March 26, 2025 at 8:06 PM
🚨 Alert: This article is dangerously true!

mlvanguards.substack.com/p/the-malpra...
The malpractice of AI industry
The dangerous rise of false authority in AI, and how it’s quietly making us dumber.
mlvanguards.substack.com
March 26, 2025 at 2:44 PM
Reposted by Hyperplane
How does GraphRAG work? What are its advantages over other RAG?

Let's return to Michael Hunger's blog post in which he demonstrates its practical application using a hashtag#Neo4j example!

Happy weekend reading!

bit.ly/4f8wLVp

hashtag#knowledgegraphs
What Is GraphRAG? - Graph Database & Analytics
GraphRAG is a powerful retrieval mechanism that improves Generative AI applications by taking advantage of the rich context in graph data structures.
bit.ly
March 25, 2025 at 1:25 PM
In less than 24 hours, we turned 5K messy files (from @kaggle.com) into a clean dataset of ~3.9K audio+transcription pairs.

Here’s how:
- Resample & filter (standardize to 16kHz, cut long/empty clips)
- Auto-transcribe with SpeechBrain (ran it on CPU — I'm GPU poor 😅)

1/2
March 26, 2025 at 10:55 AM
In the last 2 years, prompt engineering has been treated as an afterthought, a means to an end.

But in reality, a prompt is the most crucial hyperparameter of any GenAI system. Its design can make or break the output quality, much like tuning a model's parameters determines its performance.
March 25, 2025 at 5:11 PM
We're launching an ebook on 28th this week 😳

Regular speech-to-text tech struggles with kids’ voices since they’re higher-pitched and less predictable.

We worked on that by creating a smaller, more accurate model that works well with children’s speech, even in noisy or low-power settings.
March 24, 2025 at 8:42 PM
The principles of an indexing pipeline:

1/7
March 24, 2025 at 12:18 PM
Not all PDFs are created equally. Some PDFs are beautifully structured with clean text, while others are chaotic with dense layouts, tables, or images.

Ignoring these differences means risking ineffective indexing and poor search retrieval.
March 24, 2025 at 8:45 AM
Reposted by Hyperplane
It's the evals, stupid
March 21, 2025 at 11:32 AM
AI can pretty much do anything, but it lacks that human creativity. Do you prefer quick tasks done or creativity?
#AI #Question
February 28, 2025 at 12:57 PM
"prompt engineering" is just fancy copy-pasting at this point

people tweaking prompts like they're adjusting a car mirror, thinking it'll make them drive better

you’re optimizing nothing, you’re just guessing
February 26, 2025 at 11:02 AM
Hot take: Meta’s working on a brain-reading tech that turns thoughts into text. No surgery, just a headset. It’s got typos (32% errors) but skilled typists did way better. It's wild to imagine this helping folks who can’t speak or type. Would you trust a computer with your thoughts?
February 21, 2025 at 5:03 PM
Trend keeps moving with all these new tools making it tough to keep track. The question is ChatGPT or Perplexity for better searching?
February 19, 2025 at 10:10 AM
Indexing messy PDFs? Try these:

LayoutAnalyzer for messy formatting.
TextDensityAnalyzer to tell text from images.
VisualElementAnalyzer for charts/diagrams.
TableDetector to grab tables.

Convert them to images and use Vision Models like ColPali/ColQwen2.
February 18, 2025 at 12:46 PM
Are you stuck in the whirlpool of AI trends?

Building production-grade AI isn’t about chasing buzzwords — it’s about combining engineering knowledge with practical AI to deliver systems that actually work, scale, and drive real-world impact.

We got you covered at mlvanguards.substack.com
ML Vanguards | Substack
Escaping PoC purgatory: Your Weekly Guide to production paradise. Click to read ML Vanguards, a Substack publication.
mlvanguards.substack.com
January 29, 2025 at 8:03 PM