{"id":233468,"date":"2025-10-22T21:49:13","date_gmt":"2025-10-22T21:49:13","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/233468\/"},"modified":"2025-10-22T21:49:13","modified_gmt":"2025-10-22T21:49:13","slug":"new-deepseek-model-drastically-reduces-resource-usage-by-converting-text-and-documents-into-images-vision-text-compression-uses-up-to-20-times-fewer-tokens","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/233468\/","title":{"rendered":"New Deepseek model drastically reduces resource usage by converting text and documents into images \u2014 &#8216;vision-text compression&#8217; uses up to 20 times fewer tokens"},"content":{"rendered":"<p id=\"675db11a-a885-43d1-9e96-2f07829361ff\">Chinese developers of <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/deepseek-new-model-supports-huawei-cann\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/deepseek-new-model-supports-huawei-cann\" rel=\"nofollow noopener\" target=\"_blank\">Deepseek AI<\/a> have released a new model that leverages its multi-modal capabilities to improve the efficiency of its handling of complex documents and large blocks of text, by converting them into images first, <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.scmp.com\/tech\/tech-trends\/article\/3329707\/deepseek-unveils-multimodal-ai-model-uses-visual-perception-compress-text-input?module=top_story&amp;pgtype=section?registerSource=loginwall\" target=\"_blank\" data-url=\"https:\/\/www.scmp.com\/tech\/tech-trends\/article\/3329707\/deepseek-unveils-multimodal-ai-model-uses-visual-perception-compress-text-input?module=top_story&amp;pgtype=section?registerSource=loginwall\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">as per SCMP<\/a>. Vision encoders were able to take large quantities of text and convert them into images, which, when accessed later, required between seven and 20 times fewer tokens, while maintaining an impressive level of accuracy.<\/p>\n<p>Deepseek is the Chinese-developed AI that <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/deepseek-might-not-be-as-disruptive-as-claimed-firm-reportedly-has-50-000-nvidia-gpus-and-spent-usd1-6-billion-on-buildouts\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/deepseek-might-not-be-as-disruptive-as-claimed-firm-reportedly-has-50-000-nvidia-gpus-and-spent-usd1-6-billion-on-buildouts\" rel=\"nofollow noopener\" target=\"_blank\">shocked the world in early 2025<\/a>, showcasing capabilities similar to those of OpenAI&#8217;s ChatGPT, or <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tag\/google\" data-auto-tag-linker=\"true\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tag\/google\" rel=\"nofollow noopener\" target=\"_blank\">Google<\/a>&#8216;s Gemini, despite requiring far less money and data to develop. The creators have continued to work on making the AI more efficient since, and with the latest release known as DeepSeek-OCR (optical character recognition), the AI can deliver an impressive understanding of large quantities of textual data without the usual token overhead.<\/p>\n<p><a id=\"elk-seasonal\" href=\"\" data-url=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/><\/p>\n<p id=\"675db11a-a885-43d1-9e96-2f07829361ff-2\">\u201cThrough DeepSeek-OCR, we demonstrated that vision-text compression can achieve significant token reduction \u2013 seven to 20 times \u2013 for different historical context stages, offering a promising direction\u201d to handle long-context calculations, the developer said.<\/p>\n<p>You may like<\/p>\n<p class=\"paywall\" aria-hidden=\"true\">The new model is made up of two components, the DeepEncoder and DeepSeek3B-MoE-A570M, which acts as the decoder. The encoder can take large quantities of text data and convert it into high-resolution images, while the decoder is particularly adept at taking those high-resolution images and understanding the textual context within them, while requiring fewer tokens to do so than if you just fed the text right into the AI wholesale. It manages this by dissecting each task into separate sub-networks and uses specific AI agent experts to target each subset of the data.<\/p>\n<p class=\"vanilla-image-block\" style=\"padding-top:44.19%;\">\n<p><img decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/3E8Lt5XU7UvV9jP7tAVy4Y.jpg\" alt=\"Deepseek tokenization pipeline.\"   loading=\"lazy\" data-new-v2-image=\"true\" data-original-mos=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/3E8Lt5XU7UvV9jP7tAVy4Y.jpg\" data-pin-media=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/3E8Lt5XU7UvV9jP7tAVy4Y.jpg\"\/>\n<\/p>\n<p>(Image credit: Deepseek\/<a href=\"https:\/\/ai-engineering-trend.medium.com\/deepseek-enables-ai-to-recognize-text-in-images-compressing-text-into-images-for-higher-efficiency-3fd93c4f7959\" target=\"_blank\" data-url=\"https:\/\/ai-engineering-trend.medium.com\/deepseek-enables-ai-to-recognize-text-in-images-compressing-text-into-images-for-higher-efficiency-3fd93c4f7959\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">AI Engineering\/Medium<\/a>)<\/p>\n<p id=\"5082d50f-aaa8-4194-9da2-a607c0903e63\">This works really well for handling tabulated data, graphs, and other visual representations of information. This could be of particular use in finance, science, or medicine, the developers suggest.<\/p>\n<p>In <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tag\/benchmark\" data-auto-tag-linker=\"true\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tag\/benchmark\" rel=\"nofollow noopener\" target=\"_blank\">benchmarking<\/a>, the developers claim that when reducing the number of tokens by less than a factor of 10, DeepSeek-OCR can maintain a 97% accuracy rating in decoding the information. If the compression ratio is increased to 20 times, the accuracy falls to 60%. That&#8217;s less desirable and shows there are diminishing returns on this technology, but if a near-100% accuracy rate could be achieved with even a 1-2x compression rate, that could still make a huge difference in the cost of running many of the latest AI models.<\/p>\n<p>It&#8217;s also being pitched as a way of developing training data for future models, although introducing errors at that point, even in the form of a few percent off base, seems like a bad idea.<\/p>\n<p class=\"newsletter-form__strapline\">Get Tom&#8217;s Hardware&#8217;s best news and in-depth reviews, straight to your inbox.<\/p>\n<p>If you want to play around with the model yourself, it&#8217;s available via online developer platforms <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.scmp.com\/tech\/big-tech\/article\/3327205\/alibabas-qwen3-omni-tops-hugging-face-ai-ranking-chinese-open-systems-flourish?module=inline&amp;pgtype=article\" data-url=\"https:\/\/www.scmp.com\/tech\/big-tech\/article\/3327205\/alibabas-qwen3-omni-tops-hugging-face-ai-ranking-chinese-open-systems-flourish?module=inline&amp;pgtype=article\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">Hugging Face<\/a> and <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.scmp.com\/tech\/tech-trends\/article\/3214518\/microsofts-github-add-openai-chat-functions-coding-tool?module=inline&amp;pgtype=article\" data-url=\"https:\/\/www.scmp.com\/tech\/tech-trends\/article\/3214518\/microsofts-github-add-openai-chat-functions-coding-tool?module=inline&amp;pgtype=article\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">GitHub<\/a>.<\/p>\n<p><a href=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" id=\"1a5a6305-0d2b-44a5-97e1-4634188b1c0c\" data-url=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\"><\/p>\n<p class=\"vanilla-image-block\" style=\"padding-top:31.51%;\">\n<p><img decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/7cUTDmN2PHNRiNBVqbKf56.png\" alt=\"Google Preferred Source\"   loading=\"lazy\" data-new-v2-image=\"true\" data-original-mos=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/7cUTDmN2PHNRiNBVqbKf56.png\" data-pin-media=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/7cUTDmN2PHNRiNBVqbKf56.png\"\/>\n<\/p>\n<p><\/a><\/p>\n<p id=\"c15d3759-737b-44c0-8120-b5cb19ca5d70\">Follow<a data-analytics-id=\"inline-link\" href=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" target=\"_blank\" data-url=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\"> Tom&#8217;s Hardware on Google News<\/a>, or<a data-analytics-id=\"inline-link\" href=\"https:\/\/google.com\/preferences\/source?q=\" target=\"_blank\" data-url=\"https:\/\/google.com\/preferences\/source?q=\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\"> add us as a preferred source<\/a>, to get our latest news, analysis, &amp; reviews in your feeds.<\/p>\n","protected":false},"excerpt":{"rendered":"Chinese developers of Deepseek AI have released a new model that leverages its multi-modal capabilities to improve the&hellip;\n","protected":false},"author":2,"featured_media":233469,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[256,254,255,64,63,105],"class_list":{"0":"post-233468","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-au","12":"tag-australia","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/233468","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=233468"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/233468\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/233469"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=233468"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=233468"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=233468"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}