Ostris
@ostris.com
It kind of sucks that the AI/ML community seems to exclusively use twitter, at least people interested in the type of work I do.
April 23, 2025 at 6:51 PM
It kind of sucks that the AI/ML community seems to exclusively use twitter, at least people interested in the type of work I do.
Flex.2-preview is here with text to image, universal control (line, pose, depth), and inpainting all baked into one model. Fine tunable with AI-Toolkit, Apache 2.0 license, 8B parameters. huggingface.co/ostris/Flex....
April 22, 2025 at 9:54 PM
Flex.2-preview is here with text to image, universal control (line, pose, depth), and inpainting all baked into one model. Fine tunable with AI-Toolkit, Apache 2.0 license, 8B parameters. huggingface.co/ostris/Flex....
HiDream LoRA fine tuning is now live on AI-Toolkit CLI and in the GUI. It currently requires a minimum of 36 GB of VRAM. Working on getting that down.
github.com/ostris/ai-to...
github.com/ostris/ai-to...
Add Hidream support by jaretburkett · Pull Request #278 · ostris/ai-toolkit
Currently requires >37GB of vram. Working on getting that to 24, but no promises
github.com
April 16, 2025 at 8:17 PM
HiDream LoRA fine tuning is now live on AI-Toolkit CLI and in the GUI. It currently requires a minimum of 36 GB of VRAM. Working on getting that down.
github.com/ostris/ai-to...
github.com/ostris/ai-to...
Flex Redux 512 was just released. SigLIP2 512 Vision Encoder. Works with Flex.1-alpha and FLUX.1-dev. Apache2.0 license. huggingface.co/ostris/Flex....
April 4, 2025 at 2:26 PM
Flex Redux 512 was just released. SigLIP2 512 Vision Encoder. Works with Flex.1-alpha and FLUX.1-dev. Apache2.0 license. huggingface.co/ostris/Flex....
AI generated nonsense music video with a LoRA I trained of myself (Wan2.1 14B). Prompts for video, video, and music is all AI generated. I edited it myself, that is the last step to automate for a fully automated AI slop machine.
youtu.be/18SNWqdJt44
youtu.be/18SNWqdJt44
AI Generated Nonsense Music Video Starring Me
YouTube video by Ostris AI
youtu.be
March 19, 2025 at 3:14 AM
AI generated nonsense music video with a LoRA I trained of myself (Wan2.1 14B). Prompts for video, video, and music is all AI generated. I edited it myself, that is the last step to automate for a fully automated AI slop machine.
youtu.be/18SNWqdJt44
youtu.be/18SNWqdJt44
Tutorial on how to train with targeted flow guidance with AI Toolkit youtu.be/OVhusDyWoZ4
Training With Targeted Flow - Tutorial - AI Toolkit
YouTube video by Ostris AI
youtu.be
March 17, 2025 at 10:52 PM
Tutorial on how to train with targeted flow guidance with AI Toolkit youtu.be/OVhusDyWoZ4
Made some long overdue ComfyUI nodes for Flex.1-alpha.
A node to set guidance or bypass it for true CFG.
LoRA loaders that automatically prune Flux LoRAs to work with Flex. They won't work perfect, but it should be decent for most use cases.
github.com/ostris/Comfy...
A node to set guidance or bypass it for true CFG.
LoRA loaders that automatically prune Flux LoRAs to work with Flex. They won't work perfect, but it should be decent for most use cases.
github.com/ostris/Comfy...
GitHub - ostris/ComfyUI-FlexTools: Comfy UI nodes for Flex.1
Comfy UI nodes for Flex.1. Contribute to ostris/ComfyUI-FlexTools development by creating an account on GitHub.
github.com
March 14, 2025 at 6:22 PM
Made some long overdue ComfyUI nodes for Flex.1-alpha.
A node to set guidance or bypass it for true CFG.
LoRA loaders that automatically prune Flux LoRAs to work with Flex. They won't work perfect, but it should be decent for most use cases.
github.com/ostris/Comfy...
A node to set guidance or bypass it for true CFG.
LoRA loaders that automatically prune Flux LoRAs to work with Flex. They won't work perfect, but it should be decent for most use cases.
github.com/ostris/Comfy...
Wan 2.1 14B is amazing quality, but it is slow. The 1.3B version is extremely fast, and finetunes well. I trained a quick LoRA on it of myself for 1k steps. This is the most fun I have had messing with generative AI since the early SD1 days. Infinite personalized slop machine.
March 8, 2025 at 11:22 PM
Wan 2.1 14B is amazing quality, but it is slow. The 1.3B version is extremely fast, and finetunes well. I trained a quick LoRA on it of myself for 1k steps. This is the most fun I have had messing with generative AI since the early SD1 days. Infinite personalized slop machine.
First training sample montage of training a LoRA on Wan2.1 1.3B with AI Toolkit. Cruella.
Still have to test my LoRA format to see if I can get it to load anywhere or if I need to modify it. Initial release will likely only support training on stills for now.
Still have to test my LoRA format to see if I can get it to load anywhere or if I need to modify it. Initial release will likely only support training on stills for now.
March 7, 2025 at 8:57 PM
First training sample montage of training a LoRA on Wan2.1 1.3B with AI Toolkit. Cruella.
Still have to test my LoRA format to see if I can get it to load anywhere or if I need to modify it. Initial release will likely only support training on stills for now.
Still have to test my LoRA format to see if I can get it to load anywhere or if I need to modify it. Initial release will likely only support training on stills for now.
Testing out the current training version of Flex.1-alpha/Flux.1-dev Redux adapter with SigLIP2 so400m 512. My Patreon supporters can download and use the current training version now. Public release coming soon when it is done cooking.
youtu.be/J7zk9sURLcM
patreon.com/posts/123794...
youtu.be/J7zk9sURLcM
patreon.com/posts/123794...
Flex.1-alpha Redux Adapter Progress Update 1 and Drop
YouTube video by Ostris AI
youtu.be
March 6, 2025 at 8:39 PM
Testing out the current training version of Flex.1-alpha/Flux.1-dev Redux adapter with SigLIP2 so400m 512. My Patreon supporters can download and use the current training version now. Public release coming soon when it is done cooking.
youtu.be/J7zk9sURLcM
patreon.com/posts/123794...
youtu.be/J7zk9sURLcM
patreon.com/posts/123794...
Running a training test for training a redux adapter for Flex.1-alpha using siglip2-so400m-patch16-512. It is learning it remarkably fast. The 512 resolution should help with detail and texture vs the 384 v1 version.
March 4, 2025 at 3:22 AM
Running a training test for training a redux adapter for Flex.1-alpha using siglip2-so400m-patch16-512. It is learning it remarkably fast. The 512 resolution should help with detail and texture vs the 384 v1 version.
Flex.1-alpha face adapter training status update and current state demo. Still a long, long, looong way to go.
youtu.be/7WmuH2_KuOc?...
youtu.be/7WmuH2_KuOc?...
Flex Face Adapter Update 1 & Drop
YouTube video by Ostris AI
youtu.be
February 28, 2025 at 11:56 PM
Flex.1-alpha face adapter training status update and current state demo. Still a long, long, looong way to go.
youtu.be/7WmuH2_KuOc?...
youtu.be/7WmuH2_KuOc?...
5 days later and there was a UI. Only basic LoRA training for now. More features and tutorials coming soon.
February 24, 2025 at 12:07 AM
5 days later and there was a UI. Only basic LoRA training for now. More features and tutorials coming soon.
When a image/video gen model/method has a paper with no weights and no code. One can only assume that all the images/videos shown are heavily cherry picked.
February 4, 2025 at 8:43 PM
When a image/video gen model/method has a paper with no weights and no code. One can only assume that all the images/videos shown are heavily cherry picked.
There is no joke funnier than OpenAI clutching their pearls because they suspect someone may have used their data to train a LLM without their permission.
www.axios.com/2025/01/29/o...
www.axios.com/2025/01/29/o...
OpenAI says DeepSeek may have "inappropriately" used its models' output
The upstart Chinese AI maker might have "distilled" OpenAI's models, violating terms of service, OpenAI said.
www.axios.com
January 29, 2025 at 7:04 PM
There is no joke funnier than OpenAI clutching their pearls because they suspect someone may have used their data to train a LLM without their permission.
www.axios.com/2025/01/29/o...
www.axios.com/2025/01/29/o...
Reposted by Ostris
Introducing Kokoro.js, a new JavaScript library for running Kokoro TTS, an 82 million parameter text-to-speech model, 100% locally in the browser w/ WASM. Powered by 🤗 Transformers.js. WebGPU support coming soon!
👉 npm i kokoro-js 👈
Link to demo (+ sample code) in 🧵
👉 npm i kokoro-js 👈
Link to demo (+ sample code) in 🧵
January 16, 2025 at 3:05 PM
Introducing Kokoro.js, a new JavaScript library for running Kokoro TTS, an 82 million parameter text-to-speech model, 100% locally in the browser w/ WASM. Powered by 🤗 Transformers.js. WebGPU support coming soon!
👉 npm i kokoro-js 👈
Link to demo (+ sample code) in 🧵
👉 npm i kokoro-js 👈
Link to demo (+ sample code) in 🧵
Testing LoRA training for a new 8B model I have been cooking. Marty McFly and Pixar style LoRA training samples here.
It is based on a pruned version of OpenFlux that has been continuously trained. I also trained a guidance embedding for it among other cool things.
It is based on a pruned version of OpenFlux that has been continuously trained. I also trained a guidance embedding for it among other cool things.
December 19, 2024 at 2:07 AM
Testing LoRA training for a new 8B model I have been cooking. Marty McFly and Pixar style LoRA training samples here.
It is based on a pruned version of OpenFlux that has been continuously trained. I also trained a guidance embedding for it among other cool things.
It is based on a pruned version of OpenFlux that has been continuously trained. I also trained a guidance embedding for it among other cool things.
Has anyone had any luck converting FLUX LoRAs to SVDquant format? I have been trying to reverse engineer the process but keep hitting roadblocks.
December 5, 2024 at 6:11 PM
Has anyone had any luck converting FLUX LoRAs to SVDquant format? I have been trying to reverse engineer the process but keep hitting roadblocks.
Reposted by Ostris
A common question nowadays: Which is better, diffusion or flow matching? 🤔
Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.
Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.
December 2, 2024 at 6:45 PM
A common question nowadays: Which is better, diffusion or flow matching? 🤔
Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.
Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.
Seriously, why is my Amazon Echo still dumber than a box of rocks? Have any of the home assistants evolved past the technology from 10 years ago? Is someone going to do something about this or do I need to?
December 2, 2024 at 3:13 AM
Seriously, why is my Amazon Echo still dumber than a box of rocks? Have any of the home assistants evolved past the technology from 10 years ago? Is someone going to do something about this or do I need to?
Finally! Google Calendar has dark mode!
December 2, 2024 at 2:10 AM
Finally! Google Calendar has dark mode!
Testing training just an embedding that attaches like the Flux Redux output does. This is with 42 tokens doing cruella. It seems incapable of learning identity concatenating the embedding this way, leading me to think a face redux (which I am also training), may not be possible.
December 1, 2024 at 12:23 AM
Testing training just an embedding that attaches like the Flux Redux output does. This is with 42 tokens doing cruella. It seems incapable of learning identity concatenating the embedding this way, leading me to think a face redux (which I am also training), may not be possible.
You guys remember hyper networks? They just sort of disappeared when LoRA came along.
November 30, 2024 at 4:32 PM
You guys remember hyper networks? They just sort of disappeared when LoRA came along.
Why don’t people want AI models to be trained on their ideals and views. AI models WILL develop a bias based on their training data. I would prefer the biases of the people on this platform to be trained into them. Otherwise, future AI will just be a crypto bro. Please support scraping public data.
November 27, 2024 at 4:25 PM
Why don’t people want AI models to be trained on their ideals and views. AI models WILL develop a bias based on their training data. I would prefer the biases of the people on this platform to be trained into them. Otherwise, future AI will just be a crypto bro. Please support scraping public data.