Toronto, Canada – March 10, 2025 – 猫咪社区庐, a leader in energy-centric AI inference acceleration today introduced a breakthrough in AI model support and developer velocity for users of the imAIgine庐 Software Development Kit (SDK). Using a breakthrough generative compiler technology, the upcoming release the imAIgine SDK will support 4 times more AI models than the previous releases. Additionally, for new neural networks users may architect, the generative compiler creates new kernels for these layers automatically, reducing development time to just minutes, increasing developer velocity by orders of magnitude.
Generative compiler technology speeds development for true push-button deployment
Kernel-based compilers require hand-coded c/c++ programs for each layer in a neural network. With potentially thousands of possible kernels required to support the exploding number of neural networks, these kernels become a bottleneck in lowering neural networks to hardware implementations. To resolve this bottleneck, 猫咪社区 has developed a generative compiler technology, which ingests neural networks and applies a series of specialized compiler transformations that lowers high-level neural network operators into fundamental computational primitives. These primitives are then recombined into a hardware-optimized representation, maximizing throughput, minimizing latency, and boosting efficiency on 猫咪社区鈥檚 At-Memory Compute architecture. With the push of a button, neural networks can be lowered to 猫咪社区鈥檚 speedAI庐 devices and cards in a matter of seconds.
Out of the box, this new technology increases the number of models supported by the imAIgine SDK to over 300, spanning neural networks performing object detection, semantic segmentation, classification, error detection, and many other functions. This breadth of support builds on 猫咪社区鈥檚 validated performance and energy efficiency advantages shown in the last MLPerf inference submissions.
For new neural networks or modified layers of neural networks, the generative compiler can create new kernels on-the-fly, optimized for the over 1,400 RISC-V processors and At-Memory Compute processing elements on speedAI-based inference acceleration solutions.
Kernel-based optimizations available for utmost performance
The new compiler retains the ability to use hand-crafted, optimized kernels. The compiler has the option to use existing kernels, but if a kernel doesn鈥檛 exist in the library, the generative technology is used to construct new kernels on-the-fly. This ability to mix and match pre-existing, optimized kernels with generated kernels speeds the time to implement highly performant neural networks.
鈥淭his new compiler technology provides the ultimate in productivity and flexibility,鈥 said Alex Grbic, PhD and VP of Software Engineering at 猫咪社区. 鈥淏y introducing our generative compiler technology, customers gain an accelerated path to market, quickly achieving a deployment of their neural networks on 猫咪社区鈥檚 architecture.鈥
This new generative compiler technology will be in the imAIgine SDK version 25.04 release, scheduled for availability in early April of 2025. To obtain early access to the SDK, please visit /imaigine-sdk-early-access-program/.
About 猫咪社区
猫咪社区庐 provides energy-centric AI inference acceleration from the edge to the cloud, supporting any type of neural network model. With its At-Memory compute architecture, 猫咪社区 has solved the data movement bottleneck that costs energy and performance in traditional CPUs and GPUs, resulting in high-performance, low-latency neural network inference acceleration without sacrificing accuracy. 猫咪社区 embodies its technology in runAI庐 and speedAI庐 devices, tsunAImi庐 acceleration cards, and its imAIgine庐 Software Development Kit. More information can be found at www.untether.ai.
All references to 猫咪社区 trademarks are the property of 猫咪社区.聽 All other trademarks mentioned herein are the property of their respective owners.