🔥🤖📊 ARIA: The Open Multimodal AI Model Redefining Performance www.azoai.com/news/2024101... #AI #multimodal #machinelearning #opensource #textprocessing #imagemodeling #MoEarchitecture #dataintegration #longcontext #AIinnovation @arxiv-stat-ml.bsky.social
ARIA: The Open Multimodal AI Model Redefining Performance
ARIA is an open-source multimodal AI model that uses a mixture-of-experts architecture to achieve state-of-the-art performance across tasks involving text, code, images, and videos, excelling in long-...
www.azoai.com
October 16, 2024 at 12:16 AM
🔥🤖📊 ARIA: The Open Multimodal AI Model Redefining Performance www.azoai.com/news/2024101... #AI #multimodal #machinelearning #opensource #textprocessing #imagemodeling #MoEarchitecture #dataintegration #longcontext #AIinnovation @arxiv-stat-ml.bsky.social
DeepEP: New high-performance MoE communication library with advanced GPU kernels and RDMA support
https://github.com/deepseek-ai/DeepEP
#gpucomputing #aiinfrastructure #networking #moearchitecture #performanceoptimization
https://github.com/deepseek-ai/DeepEP
#gpucomputing #aiinfrastructure #networking #moearchitecture #performanceoptimization
February 25, 2025 at 5:34 AM
DeepEP: New high-performance MoE communication library with advanced GPU kernels and RDMA support
https://github.com/deepseek-ai/DeepEP
#gpucomputing #aiinfrastructure #networking #moearchitecture #performanceoptimization
https://github.com/deepseek-ai/DeepEP
#gpucomputing #aiinfrastructure #networking #moearchitecture #performanceoptimization
PyTorch profiling data reveals optimization strategies for MoE layers in training and inference
https://github.com/deepseek-ai/profile-data
#performanceprofiling #deeplearning #moearchitecture #pytorch #parallelcomputing
https://github.com/deepseek-ai/profile-data
#performanceprofiling #deeplearning #moearchitecture #pytorch #parallelcomputing
February 27, 2025 at 7:21 PM
PyTorch profiling data reveals optimization strategies for MoE layers in training and inference
https://github.com/deepseek-ai/profile-data
#performanceprofiling #deeplearning #moearchitecture #pytorch #parallelcomputing
https://github.com/deepseek-ai/profile-data
#performanceprofiling #deeplearning #moearchitecture #pytorch #parallelcomputing