Running DeepSeek Model on Qualcomm Hexagon Processor
Learn how to run DeepSeek LLM locally on Snapdragon X Elite using ONNX Runtime and Qualcomm Hexagon NPU for blazing-fast AI performance. This step-by-step guide covers: Setting up Windows on Snapdragon Python environment Installing ONNX Runtime with Hexagon execution provider Deploying DeepSeek R1 (7B) model with tokenizer Optimizing inference with burst mode, low power mode, and caching Implementing greedy sampling, temperature, top-k, and repetition penalty Running chatbot in Jupyter Notebook and command-line app Boost data privacy, security, and efficiency by owning your LLM and running it on-device. This video is ideal for AI developers, edge computing enthusiasts, and anyone building local AI solutions. If you have any questions or need further assistance, feel free to leave a comment below. Subscribe: More Qualcomm Developer videos: http://tinyurl.com/2p8xmcw6 Join our community of developers on Discord to stay up-to-date, find expert support, and access exclusive virtual events: https://discord.gg/THUPBtskgs
Download
0 formatsNo download links available.