Checklist
Motivation
Adapt and optimize the high-precision text-image recognition model DeepSeek-OCR2. The model supports multilingual text recognition, handwritten character recognition, and layout parsing of complex documents.
Based on the SGLang framework, we complete the adaptation of the full pipeline including image preprocessing, visual encoding and text decoding. It is compatible with the computing characteristics of Ascend NPU, optimizes the computing resource consumption of image inference, and ensures that both OCR recognition accuracy and inference speed meet operational requirements.
Related resources
No response
Checklist
Motivation
Adapt and optimize the high-precision text-image recognition model DeepSeek-OCR2. The model supports multilingual text recognition, handwritten character recognition, and layout parsing of complex documents.
Based on the SGLang framework, we complete the adaptation of the full pipeline including image preprocessing, visual encoding and text decoding. It is compatible with the computing characteristics of Ascend NPU, optimizes the computing resource consumption of image inference, and ensures that both OCR recognition accuracy and inference speed meet operational requirements.
Related resources
No response