{"id":19764,"date":"2026-04-29T21:30:24","date_gmt":"2026-04-29T21:30:24","guid":{"rendered":"https:\/\/jobuzo.com\/en\/the-whale-can-now-see-deepseek-adds-ai-vision-in-major-move\/"},"modified":"2026-04-29T21:30:24","modified_gmt":"2026-04-29T21:30:24","slug":"the-whale-can-now-see-deepseek-adds-ai-vision-in-major-move","status":"publish","type":"post","link":"https:\/\/jobuzo.com\/en\/the-whale-can-now-see-deepseek-adds-ai-vision-in-major-move\/","title":{"rendered":"\u2018The whale can now see\u2019: DeepSeek adds AI vision in major move"},"content":{"rendered":"<div>\n<div><img decoding=\"async\" src=\"https:\/\/cdn.i-scmp.com\/sites\/default\/files\/styles\/og_image_scmp_generic\/public\/d8\/images\/canvas\/2026\/04\/29\/f289c410-b32c-4d0f-913d-eae93a382379_f633b540.jpg?itok=Smh1i1mS&amp;v=1777472630\" class=\"ff-og-image-inserted\"><\/div>\n<div datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1xdhyk6 ec74h0k0\" readability=\"10.666666666667\">Chinese artificial intelligence start-up <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">DeepSeek<\/span> has added multimodal capabilities to its flagship chatbot for the first time &ndash; meaning that it can process images and video in addition to text &ndash; bringing it in line with rivals that already offer the function.<\/div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">The limited release to select users comes just days after the Hangzhou-based company released its new flagship model V4, which was followed by extensive price cuts.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">According to DeepSeek multimodal team leader Chen Xiaokang, who made the announcement on Wednesday on social media, the function was initially offered to select users on DeepSeek&rsquo;s chatbot website and mobile application for beta testing.<\/p>\n<div data-qa=\"InlineAdSlot-Container\" class=\"css-zl1inp e11v3ui14\">\n<div class=\"e11v3ui10 e11v3ui13 css-1umbx1w e1flwkbl0\" data-qa=\"AdSlot-Container\">\n<p>Advertisement<\/p>\n<\/div>\n<\/div>\n<div datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1xdhyk6 ec74h0k0\" readability=\"10.591304347826\">&ldquo;Come try out the incredible work from our genius multimodal colleagues!&rdquo; <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">senior researcher Chen Deli<\/span> wrote on social media shortly after, adding that &ldquo;the little whale can now see&rdquo;, a reference to DeepSeek&rsquo;s whale logo.<\/div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">On DeepSeek&rsquo;s chat interface, a new &ldquo;image recognition mode&rdquo; had been added alongside the &ldquo;expert&rdquo; and &ldquo;flash&rdquo; chat modes, which were introduced earlier this month.<\/p>\n<div data-qa=\"InlineAdSlot-Container\" class=\"css-zl1inp e11v3ui14\">\n<div class=\"e11v3ui10 e11v3ui13 css-1xd1hm8 e1flwkbl0\" data-qa=\"AdSlot-Container\">\n<p>Advertisement<\/p>\n<\/div>\n<\/div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">As AI continues to rapidly progress, multimodal capabilities are viewed as a necessity to move beyond simple text conversations with users into more complex and economically valuable domains.<\/p>\n<div class=\"internal-linking-related-contents\"><a href=\"https:\/\/jobuzo.com\/en\/12-weeks-jail-for-school-it-support-technician-who-took-upskirt-videos-of-teachers\/\" class=\"template-1\"><span class=\"cta\">News :<\/span><span class=\"postTitle\">&lt;div&gt;12 weeks' jail for school IT support technician who took upskirt videos of teachers&lt;\/div&gt;<\/span><\/a><\/div><p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">While DeepSeek&rsquo;s breakout moment in January 2025 made it a household name internationally due to its model&rsquo;s powerful reasoning capabilities and cost-efficiency, the start-up&rsquo;s lack of a multimodal offering since then has been seen as an Achilles&rsquo; heel.<\/p>\n<\/div>\n<p><sub>&lsquo;The whale can now see&rsquo;: DeepSeek adds AI vision in major move<\/sub><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Chinese artificial intelligence start-up DeepSeek has added multimodal capabilities to its flagship chatbot for the first time &ndash; meaning that it can process images and video in addition to text &ndash; bringing it in line with rivals that already offer the function. The limited release to select users comes just days after the Hangzhou-based company&#8230;<\/p>\n<p class=\"more-link-wrap\"><a href=\"https:\/\/jobuzo.com\/en\/the-whale-can-now-see-deepseek-adds-ai-vision-in-major-move\/\" class=\"more-link\">Read More<span class=\"screen-reader-text\"> &ldquo;\u2018The whale can now see\u2019: DeepSeek adds AI vision in major move&rdquo;<\/span> &raquo;<\/a><\/p>\n","protected":false},"author":1,"featured_media":19765,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-19764","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news"],"_links":{"self":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts\/19764","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/comments?post=19764"}],"version-history":[{"count":0,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts\/19764\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/media\/19765"}],"wp:attachment":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/media?parent=19764"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/categories?post=19764"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/tags?post=19764"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}