SOPHON SDK is the SOPHGO’s proprietary SDK based on SOPHGO Deep learning processor, with its powerful tools, you can deploy the application in the runtime environment, and deliver the maximum inference throughput and efficiency.
There are two device drive modes, PCIE and SOC, developers have more choices.
Combined with the Deep learning processor independently developed by SOPHGO, it provides the largest inference throughput and the simplest application deployment environment.
Provide runtime library programming interface for manipulating the underlying computing resources, users can conduct in-depth development.
The runtime library provides concurrent processing capabilities and supports multi-process and multi-thread modes.
SOPHON SDK has two kinds of compilation. For the layer that TPU support, you can use the MLIR to compile and deploy. For the layer that TPU can’t support currently, you can extend the compiler by MLIR programming interface, use the TPU Kernel programming interface or RISC-V instructions to add custom network layer, enable users to compile a non-public network.
We provide developers with docker image for development, which integrated the tools and libraries required for SOPHON SDK, developers can use it to develop the deep learning application.
The compiled network and the application can be deployed through Runtime after integrated. In the deployed process, you can use the Runtime inference engine API interface for programming.