IBM stated that Granite 4.0 models reduce RAM usage by more than 70% compared to transformer-based models in tasks involving ...
Built for long-context tasks and edge deployments, Granite 4.0 combines Mamba’s linear scaling with transformer precision, ...