{"id":5290,"date":"2023-04-26T18:29:54","date_gmt":"2023-04-26T18:29:54","guid":{"rendered":"https:\/\/redmonk.com\/jgovernor\/?p=5290"},"modified":"2023-04-26T18:36:49","modified_gmt":"2023-04-26T18:36:49","slug":"haptics-hallucinations-retrieval-augmentation-and-a-multi-model-llm-future","status":"publish","type":"post","link":"https:\/\/redmonk.com\/jgovernor\/haptics-hallucinations-retrieval-augmentation-and-a-multi-model-llm-future\/","title":{"rendered":"Haptics, Hallucinations, Retrieval-Augmentation and a multi-model LLM future"},"content":{"rendered":"<p>We all know what the industry\u2019s main character is right now &#8211; ChatGPT. But natural language processing (NLP) is in many respects a project as old as tech itself. A ton of companies are working on this stuff, some even before the current round of hype, with the attendant Great Pivot from Web3 to LLM. One such company is <a href=\"https:\/\/www.deepset.ai\/\">deepset<\/a>.<\/p>\n<p>Founded in 2018 in Berlin by Milos Rusic, Malte Pietsch, and Timo M\u00f6ller, deepset maintains the open source <a href=\"https:\/\/haystack.deepset.ai\/\">haystack<\/a> project, which is designed to make it easier to use Transformers and large language models (LLMs) in your applications. Transformers are a concept introduced by Google in 2017 in the seminal paper <a href=\"https:\/\/ai.googleblog.com\/2017\/08\/transformer-novel-neural-network.html\">Attention Is All You Need<\/a> &#8211; a neural network architecture that has dramatically accelerated the state of the art in AI\/ML. deepset wants to make this kind of technology usable and useful by the enterprise, with both on prem and cloud products. Because for all the excitement about LLMs and related technologies there is also a lot of fear, uncertainty and doubt. Who owns the models likely owns the moats. Enterprises and governments are concerned about ownership and business sustainability. Samsung recently had a leak of source code and trade secrets after <a href=\"https:\/\/www.techradar.com\/news\/samsung-workers-leaked-company-secrets-by-using-chatgpt\">engineering teams used ChatGPT<\/a> in a planning meeting. ChatGPT has been <a href=\"https:\/\/www.bbc.co.uk\/news\/technology-65139406\">banned in Italy<\/a> because of piracy concerns. So much for data protection &#8211; It\u2019s not clear whether the type of crawling and learning approaches pioneered by OpenAI are even compatible with EU law, in the shape of the the General Data Protection Regulation (GDPR). Ant Stanley covers a lot of this in a great post on his new blog, with this post <a href=\"https:\/\/precipitation.substack.com\/p\/ask-for-forgiveness-not-permission\">Ask for forgiveness, Not permission<\/a><\/p>\n<p>Anyway, when an area is so hot it\u2019s always interesting to talk to folks that are steeped in it. I was lucky enough to catch up with Pietsch recently, for a RedMonk Conversation video. It was funny that we both have stories about moms using ChatGPT. While I am not a fan of the \u201ceven my mom can do it\u201d framing, it\u2019s definitely worth paying attention when a technology is crossing over so fast to mainstream adoption. Conversational AI based on LLMs is \u201chaptic\u201d &#8211; the feedback loops are just very immediate. Insert Mythic Quest reference here.<\/p>\n<p>Mainstream adoption creates all kinds of challenges for the kind of innovation unleashed by OpenAI and ChatGPT. That\u2019s where data and model sovereignty, compliance, the avoidance of AI-driven hallucinations in content, code and decision-making comes in. Those are the kinds of areas where deepset is focusing its attention. What multicloud was to the last 10 years, multi-model probably will be to the next ten. We\u2019ve already seen <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/announcing-new-tools-for-building-with-generative-ai-on-aws\/\">AWS start positioning itself accordingly<\/a>\/. Multi sounds good when you\u2019re not the market leader.<\/p>\n<p><a href=\"https:\/\/redmonk.com\/jgovernor\/2023\/04\/13\/the-great-flowering-why-openai-is-the-new-aws-and-the-new-kingmakers-still-matter\/\">OpenAI will be a winner<\/a>, but not the only one.<\/p>\n<p>A concept you\u2019ll be hearing a lot more about is Retrieval Augmentation &#8211; in terms of improving models. Again we cover that in the conversation. So dive in!<\/p>\n<p>So watch the video, and tell me what you think, here or on Youtube, but in the meantime I will leave you with a story from deepset about a gentleman in his 80s that runs a legal publishing firm in Germany. He called deepset just before Christmas last year to insist on a meeting before the end of the year to discuss ChatGPT\u2019s potential implications on his business, and how he could do something similar but without giving his own information away. ChatGPT only launched on November 30th 2022. That\u2019s the scale of the challenge, and the opportunity.<\/p>\n<p><span class=\"embed-youtube\" style=\"text-align:center; display: block;\"><iframe class='youtube-player' width='640' height='360' src='https:\/\/www.youtube.com\/embed\/4RkuR_humFE?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en-US&#038;autohide=2&#038;wmode=transparent' allowfullscreen='true' style='border:0;' sandbox='allow-scripts allow-same-origin allow-popups allow-presentation'><\/iframe><\/span><\/p>\n<p>disclosure: AWS, Google and Microsoft are all clients. deepset sponsored this video.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We all know what the industry\u2019s main character is right now &#8211; ChatGPT. But natural language processing (NLP) is in many respects a project as old as tech itself. A ton of companies are working on this stuff, some even before the current round of hype, with the attendant Great Pivot from Web3 to LLM.<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false},"categories":[1],"tags":[],"class_list":["post-5290","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"jetpack_featured_media_url":"","jetpack_publicize_connections":[],"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p9wfjh-1nk","_links":{"self":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/posts\/5290","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/comments?post=5290"}],"version-history":[{"count":0,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/posts\/5290\/revisions"}],"wp:attachment":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/media?parent=5290"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/categories?post=5290"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/tags?post=5290"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}