{"id":9150,"date":"2025-10-21T03:19:50","date_gmt":"2025-10-21T03:19:50","guid":{"rendered":"https:\/\/jobuzo.com\/en\/deepseek-unveils-ai-model-that-uses-visual-perception-to-compress-text-input\/"},"modified":"2025-10-21T03:19:50","modified_gmt":"2025-10-21T03:19:50","slug":"deepseek-unveils-ai-model-that-uses-visual-perception-to-compress-text-input","status":"publish","type":"post","link":"https:\/\/jobuzo.com\/en\/deepseek-unveils-ai-model-that-uses-visual-perception-to-compress-text-input\/","title":{"rendered":"DeepSeek unveils AI model that uses visual perception to compress text input"},"content":{"rendered":"<div>\n<div datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1xdhyk6 ec74h0k0\" readability=\"9.76\"><span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">DeepSeek<\/span> on Monday released a new multimodal <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">artificial intelligence<\/span> model that can handle large and complex documents with significantly fewer tokens &ndash; the smallest unit of text that a model processes &ndash; by using visual perception as a compression medium for information.<\/div>\n<div data-qa=\"InlineAdSlot-Container\" class=\"css-zl1inp e11v3ui14\">\n<div class=\"e11v3ui10 e11v3ui13 css-1evd7i0 e1flwkbl0\" data-qa=\"AdSlot-Container\">\n<p>Advertisement<\/p>\n<\/div>\n<\/div>\n<div datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1xdhyk6 ec74h0k0\" readability=\"14.658064516129\">The open-source DeepSeek-OCR (optical character recognition) model, available via online developer platforms <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">Hugging Face<\/span> and <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">GitHub<\/span>, was the result of an &ldquo;investigation into the role of vision encoders&rdquo; to compress text for large language models (LLMs), the <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">Hangzhou<\/span>-based AI start-up said in a blog post.<\/div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">By using that approach, LLMs would be able to process a massive amount of text without incurring a proportional increase in computing cost.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">&ldquo;Through DeepSeek-OCR, we demonstrated that vision-text compression can achieve significant token reduction &ndash; seven to 20 times &ndash; for different historical context stages, offering a promising direction&rdquo; to address long-context challenges in LLMs, the company said.<\/p>\n<div datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1xdhyk6 ec74h0k0\" readability=\"12.829508196721\">That showed DeepSeek&rsquo;s steadfast efforts to raise the efficiency of AI models, while driving down the costs of building and using them &ndash; a principle that the company followed in the development of its breakthrough open-source models <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">V3<\/span> and <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">R1<\/span> that were released in December and February, respectively.<\/div>\n<div class=\"methode-html-wrapper oembed-wrapper e1a5rv550 css-1llrc1m e1yqhwb40\" data-qa=\"Component-renderMap-StyledDiv\">\n<div class=\"e1drg7e30 e1ciypty0 ehdmpxk0 css-qpd3t0 e1hpd36k0\" data-qa=\"Component-SCMPYoutubeVideoContainer\" readability=\"6\">\n<div data-qa=\"SCMPYoutubeVideoPreview-PreviewContainer\" class=\"css-zg359x e1o401186\">\n<div data-qa=\"SCMPYoutubeVideoPreview-PreviewContentContainer\" class=\"css-qgt9eo e1o401187\">\n<div data-qa=\"SCMPYoutubeVideoPreview-PreviewContentFloatContainer\" class=\"css-7nqkgi e1o401183\" readability=\"6\">\n<div data-qa=\"SCMPYoutubeVideoPreview-PreviewContentDataContainer\" class=\"css-1pcpmhl e1o401180\" readability=\"7\">\n<p data-qa=\"SCMPYoutubeVideoPreview-PreviewTitle\" class=\"css-zasw6y e1o401182\">[LIVE] China Future Tech webinar | How is DeepSeek shaping the race for AI supremacy?<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"e1hpd36k1 css-79hwxn e1x11wau0\" data-qa=\"SCMPYouTubeVideoFooter-VideoTitleContainer\" readability=\"7\"><svg width=\"24\" height=\"25\" viewbox=\"0 0 24 25\" fill=\"none\" data-qa=\"SCMPYouTubeVideoFooter-StyledVideoRecorder\" class=\"css-1nnwg93 e1x11wau1\"><path fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M22 6.22926V18.4315L17.5463 13.9868V17.8655C17.5463 18.1618 17.4383 18.4248 17.2228 18.6531C17.0067 18.8829 16.7498 18.9967 16.4537 18.9967H3.13414C2.80989 18.9967 2.53968 18.8829 2.32425 18.6531C2.10808 18.4248 2 18.1618 2 17.8655V6.79519C2 6.49893 2.10808 6.23665 2.32425 6.00688C2.53968 5.77858 2.80989 5.66333 3.13414 5.66333H16.4537C16.7498 5.66333 17.0067 5.77858 17.2228 6.00688C17.4383 6.23665 17.5463 6.49893 17.5463 6.79519V10.674L22 6.22926Z\" fill=\"#001246\"><\/path><\/svg>\n<p>[LIVE] China Future Tech webinar | How is DeepSeek shaping the race for AI supremacy?<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">According to the company&rsquo;s blog post, DeepSeek-OCR consisted of two main components: DeepEncoder and DeepSeek3B-MoE-A570M as the decoder.<\/p>\n<div data-qa=\"InlineAdSlot-Container\" class=\"css-zl1inp e11v3ui14\">\n<div class=\"e11v3ui10 e11v3ui13 css-117a6hs e1flwkbl0\" data-qa=\"AdSlot-Container\">\n<div class=\"internal-linking-related-contents\"><a href=\"https:\/\/jobuzo.com\/en\/12-weeks-jail-for-school-it-support-technician-who-took-upskirt-videos-of-teachers\/\" class=\"template-1\"><span class=\"cta\">News :<\/span><span class=\"postTitle\">&lt;div&gt;12 weeks' jail for school IT support technician who took upskirt videos of teachers&lt;\/div&gt;<\/span><\/a><\/div><p>Advertisement<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p><sub>DeepSeek unveils AI model that uses visual perception to compress text input<\/sub><\/p>\n","protected":false},"excerpt":{"rendered":"<p>DeepSeek on Monday released a new multimodal artificial intelligence model that can handle large and complex documents with significantly fewer tokens &ndash; the smallest unit of text that a model processes &ndash; by using visual perception as a compression medium for information. Advertisement The open-source DeepSeek-OCR (optical character recognition) model, available via online developer platforms&#8230;<\/p>\n<p class=\"more-link-wrap\"><a href=\"https:\/\/jobuzo.com\/en\/deepseek-unveils-ai-model-that-uses-visual-perception-to-compress-text-input\/\" class=\"more-link\">Read More<span class=\"screen-reader-text\"> &ldquo;DeepSeek unveils AI model that uses visual perception to compress text input&rdquo;<\/span> &raquo;<\/a><\/p>\n","protected":false},"author":1,"featured_media":9151,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-9150","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news"],"_links":{"self":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts\/9150","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/comments?post=9150"}],"version-history":[{"count":0,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts\/9150\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/media\/9151"}],"wp:attachment":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/media?parent=9150"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/categories?post=9150"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/tags?post=9150"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}