Apple Silicon ML, without Python.

Pure-Go bindings to Apple's MLX with 24+ model architectures, training and inference paths, and single static binaries. Linux CUDA is supported for selected workflows.

Pre-1.0. Source private today, available for review on request. Used in production by skiff for local inference.

What it does

mlx-go is the Go-native compute foundation for on-device inference on Apple Silicon. Built on apple's Metal bindings, profiled with gputrace, and used by skiff for local inference. The product claim is simple: ship MLX-backed inference from Go without pulling a Python runtime into the deployment.