Skip to content

open-mosaic/mosaic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Open Mosaic Logo

Mosaic: Always-on GPU Collective Observability

Integration Test Unit Test codecov

Mosaic is an open-source always-on observability tool for GPU collective communication, providing near real-time visibility into performance and reliability issues in large-scale AI workloads. It treats collective communication as first-class OpenTelemetry data, enabling correlation with GPU, network, and system signals in a single view.

At GPU scale, where failures and inefficiencies are inevitable, Mosaic makes these issues visible early, shifting observability from postmortem analysis to continuous operations.

No offline tracing, no bespoke pipelines, and no invasive instrumentation.

Getting Started

To get Mosaic up and running in your environment, follow our Quick Start Guide.

Documentation

For a deep dive into core concepts, architecture, and advanced configuration, visit Mosaic Documentation

About

Always-on observability tool for GPU collective communication, providing near real-time visibility into performance and reliability issues in large-scale AI workloads. It treats collective communication as first-class OpenTelemetry data, enabling correlation with GPU, network, and system signals in a single view.

Resources

License

Security policy

Stars

Watchers

Forks

Contributors