Lightnews — Scholar-powered news

Louis Maddox

@permutans.bsky.social

150 followers 110 following 2.6K posts

Combinatorially curious https://spin.systems

Posts Replies Media Videos

Pinned

Louis Maddox @permutans.bsky.social · Nov 5

My new library and CLI for precise file patching is up!

textum docs.rs/textum/lates...

You give a target to delete/replace/insert at, and whether to include/exclude/extend the match boundary

Target can be specified as:
💬 String 🧩 Regex 📏 Line/Char/Byte # 📐 Position (row, col)

🔮 tree-sitter AST 🔜

The docs for textum crate

—

A syntactic patching library with character-level granularity.

textum provides a robust way to apply patches to source files using rope data structures for efficient editing and a powerful snippet system for flexible target specification. Unlike traditional line-based patch formats, textum operates with character, byte, and line granularity through the Snippet API, supporting literal matching, regex patterns, and boundary semantics.

Core Concepts

Patches
A Patch specifies a file, a Snippet defining the target range, and replacement text. Patches compose through PatchSet, which handles resolution, validation, and application.

Snippets
Snippets define text ranges through:

Targets: What to match (Literal, Pattern, Line, Char, Position)
Boundaries: How to treat matches (Include, Exclude, Extend)
Modes: Range selection (At, From, To, Between, All)
Hunks
textum works with hunks - contiguous change blocks that may include context through boundary extension. Multiple patches with overlapping non-empty replacements are rejected to maintain unambiguous application order.

Louis Maddox

@permutans.bsky.social

Going to upload my first m*del to H*ggingF*ce [dies of embarassment]

December 17, 2025 at 12:17 AM

Louis Maddox

@permutans.bsky.social

ONNX Model Explorer - a Hugging Face Space by onnx-community huggingface.co/spaces/onnx-...

ONNX Model Explorer - a Hugging Face Space by onnx-community

Interactively explore and visualize ONNX models. Upload your model to see its structure, layers, and operations. Understand how your model works with detailed insights.

huggingface.co

December 17, 2025 at 12:03 AM

Louis Maddox

@permutans.bsky.social

Ohhh TensorRT doesn't like (some types of) quant models...

December 16, 2025 at 11:03 PM

Louis Maddox

@permutans.bsky.social

😇

Wheel for the polars-fastembed-cuda package, a 40.3MB zip file

December 16, 2025 at 10:12 PM

Louis Maddox

@permutans.bsky.social

hm, Windows binaries are incompressible by UPX due to a "Control Flow Guard"...

December 16, 2025 at 8:30 PM

Louis Maddox

@permutans.bsky.social

These new macOS 14 GHA runners have agoraphobia ._.

December 16, 2025 at 6:19 PM

Reposted by Louis Maddox

Savannah Ostrowski

@savannah.dev

the audacity of github to charge me to use my own self-hosted runners

resources.github.com/actions/2026...

Pricing changes for GitHub Actions

GitHub Actions pricing update: Discover lower runner rates (up to 39% off) following a major re-architecture for faster, more reliable CI/CD.

resources.github.com

December 16, 2025 at 6:07 PM

Louis Maddox

@permutans.bsky.social

ONNX runtime CUDA dynlibs shrunk -77% with upx + still work 🥳

December 16, 2025 at 5:32 PM

Reposted by Louis Maddox

Sarah O'Connor

@sarahoconnorft.ft.com

December 16, 2025 at 1:26 PM

Louis Maddox

@permutans.bsky.social

New record: 3 Python packages published from one repo across separate workflows, 1 mixed Python/Rust via maturin-action, 2 regular pure Python pypa/gh-action-pypi-publish (all via Trusted Publishing) 🚀🚀🚀

December 16, 2025 at 3:43 PM

Louis Maddox

@permutans.bsky.social

🗞️ “US pauses implementation of $40 billion technology deal with Britain” www.reuters.com/world/europe...

US pauses implementation of $40 billion technology deal with Britain

The United States has paused a $40 billion technology agreement with Britain, officials said, following concerns in Washington over London's approach to digital regulation and food standards.

www.reuters.com

December 16, 2025 at 12:22 PM

Louis Maddox

@permutans.bsky.social

Decided to rename ‘release paraphernalia’ to ‘mech’ because my brain was not caffeinated enough to type that all into a commit message

December 16, 2025 at 11:04 AM

Louis Maddox

@permutans.bsky.social

I guess we storing JSON in NPZ now

December 16, 2025 at 1:42 AM

Louis Maddox

@permutans.bsky.social

v cool: Luxical from datalogyai www.datologyai.com/blog/introdu...
Snowflake Arctic (m-v2.0) as teacher model, 192D embeddings, vocab of 5-grams from FineWeb, BERT uncased tokeniser, custom CPU kernel in numba github.com/datologyai/l...

Buried lede: arrow-tokenize in Rust github.com/datologyai/l...

luxical/src/luxical/csr_matrix_utils.py at e40f6bb3bdcca7776740a0544009c5bb83eef6e3 · datologyai/luxical

Contribute to datologyai/luxical development by creating an account on GitHub.

github.com

December 16, 2025 at 1:03 AM

Louis Maddox

@permutans.bsky.social

Optimal transport planning on embeddings (Neurips 2024) proceedings.neurips.cc/paper_files/...
FASTopic github.com/bobxwu/fasto...

proceedings.neurips.cc

December 16, 2025 at 12:19 AM

Louis Maddox

@permutans.bsky.social

“we observe that TensorRT always outperforms CUDA”

Table showing TensorRT consumes less energy than CUDA from “Green AI: A Preliminary Empirical Study on Energy Consumption
in DL Models Across Different Runtime Infrastructures”

December 16, 2025 at 12:06 AM

Louis Maddox

@permutans.bsky.social

30+ min ONNX builds down to <2m ✧⁠\⁠(⁠>⁠o⁠<⁠)⁠ﾉ⁠✧

December 15, 2025 at 10:44 PM

Reposted by Louis Maddox

Mark J. Nelson

@mm-jj-nn.bsky.social

uh oh: "By operating directly over raw UTF-8 bytes..."

December 15, 2025 at 5:22 PM

Louis Maddox

@permutans.bsky.social

hmm TensorRT is a no for embeddings apparently

December 15, 2025 at 4:41 PM

Louis Maddox

@permutans.bsky.social

📝 TensorRT 10.x installation notes on Ubuntu 24.04 github.com/lmmx/devnote...

Installing TensorRT 10.x on Ubuntu 24.04

obscure technical resolutions re: errors, installation quirks, custom setups etc. - lmmx/devnotes

github.com

December 15, 2025 at 3:57 PM

Louis Maddox

@permutans.bsky.social

“Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali” github.com/michaelfeil/...

GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali - michaelfeil/infinity

github.com

December 15, 2025 at 3:19 PM

Louis Maddox

@permutans.bsky.social

Interesting, ONNX runtime has separate providers for TensorRT & “Tensor on RTX” onnxruntime.ai/docs/executi...
Launched 6 months ago github.com/NVIDIA/Tenso...