{"id":204,"date":"2023-12-28T07:56:54","date_gmt":"2023-12-28T12:56:54","guid":{"rendered":"http:\/\/www.zaklab.org\/blog\/?p=204"},"modified":"2023-12-28T07:58:42","modified_gmt":"2023-12-28T12:58:42","slug":"embrace-your-inner-robopsychologist","status":"publish","type":"post","link":"https:\/\/www.zaklab.org\/blog\/embrace-your-inner-robopsychologist\/","title":{"rendered":"Embrace your inner robopsychologist."},"content":{"rendered":"\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>And just for a moment he forgot, or didn\u2019t want to remember, that other robots might be more ignorant than human beings. His very superiority caught him.<\/p>\n<cite>Dr. Susan Calvin in &#8220;Little Lost Robot&#8221; by Isaac Asimov, first published in Astounding Science Fiction, 1947 and anthologized by Isaac Asimov in I, Robot, Gnome Press, 1950.<\/cite><\/blockquote>\n\n\n\n<p><em>Version 0.1 (Revision history at the bottom) December 28th, 202<\/em>3<\/p>\n\n\n\n<figure class=\"wp-block-gallery has-nested-images columns-default is-cropped wp-block-gallery-1 is-layout-flex wp-block-gallery-is-layout-flex\">\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" data-id=\"211\" src=\"https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/neurorobot.webp\" alt=\"\" class=\"wp-image-211\" srcset=\"https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/neurorobot.webp 1024w, https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/neurorobot-300x300.webp 300w, https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/neurorobot-150x150.webp 150w, https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/neurorobot-768x768.webp 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<\/figure>\n\n\n\n<p>When I was a doctoral student working on <a href=\"https:\/\/groups.csail.mit.edu\/medg\/ftp\/kohane\/MIT-LCS-TR-389.pdf\">my thesis<\/a> in computer science in an earlier heyday of artificial intelligence, if you&#8217;d ask me how I how I&#8217;d find out why a program did not perform as expected, I would come up with a half dozen heuristics, most of them near cousins of standard computer programming debugging techniques.<sup data-fn=\"c4b06a66-e2cc-4a4e-b0c1-b1e3dc1fd18a\" class=\"fn\"><a href=\"#c4b06a66-e2cc-4a4e-b0c1-b1e3dc1fd18a\" id=\"c4b06a66-e2cc-4a4e-b0c1-b1e3dc1fd18a-link\">1<\/a><\/sup>  Even though I was a diehard science fiction reader, I gave short shrift to the techniques illustrated by the expert robopsychologist\u2014Dr. Susan Calvin\u2014introduced into his robot short stories in the 1940&#8217;s by Isaac Asimov. These seemed more akin the the logical dissections performed by Conan Doyle&#8217;s Sherlock Holmes than anything I could recognize as computer science. <\/p>\n\n\n\n<p>Yet over the last five years, particularly since 2020,  English (and other language) prompts\u2014human-like statements or questions, often called &#8220;hard prompts&#8221; to distinguish them from <a href=\"https:\/\/openreview.net\/forum?id=MHWDdMEJ5s\">&#8220;soft prompts&#8221;<\/a><sup data-fn=\"d2abfbaf-4ed3-47dd-98e4-a5ac1ca9fecc\" class=\"fn\"><a href=\"#d2abfbaf-4ed3-47dd-98e4-a5ac1ca9fecc\" id=\"d2abfbaf-4ed3-47dd-98e4-a5ac1ca9fecc-link\">2<\/a><\/sup> \u2014have come into wide use. Interest in hard prompts grew rapidly after the release of ChatGPT and was driven by creative individuals who figured out, through experimentation, which prompts worked particularly well for specific tasks. This was jarring to many computer scientists such as Andrej Karpathy who declared <a href=\"https:\/\/twitter.com\/karpathy\/status\/1617979122625712128?lang=en\">&#8220;The hottest new programming language is English.&#8221;<\/a>  Ethan and Lilach Mollick are exemplars of non-computer scientist creatives that have pushed the envelope in their own domain using mastery of hard prompts. They have been inspired  leaders in developing <a href=\"https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=4475995\">sets of prompts for many common educational tasks<\/a> that resulted in functionality that has surpassed and replaced whole suites of commercial educational software.  <\/p>\n\n\n\n<p>After the initial culture shock, many researchers have started working on ways to automate optimization of hard prompts (e.g. <a href=\"http:\/\/arxiv.org\/abs\/2302.03668\">Wen et al<\/a>., <a href=\"http:\/\/arxiv.org\/abs\/2306.12509\">Sordoni<\/a> <em>et al<\/em>.) How well this works for all applications of generative AI (now less frequently referred to as large language models, and foundation models, even though technically they do not denote the same thing) in medicine in particular remains to be determined. I&#8217;ll try to write a post about optimizing prompts for medicine soon, but right now, I cannot help but notice that in my interactions with GPT-4 or Bard, when I do not get the answer I expect, my interactions resemble a conversation with a sometimes reluctant, sometimes confused, sometimes ignorant assistant who has frequent flashes of brilliance.<\/p>\n\n\n\n<p>Early on, some of the skepticism about the performance of large language models centered on the capacity of these models for &#8220;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Theory_of_mind\">theory of mind<\/a>&#8221; reasoning. Understanding the possible state of mind of a human was seen as an important measure of artificial general intelligence. I&#8217;ll step away from the debate of whether or not GPT-4, Bard <em>et al,<\/em> show evidence of theory of mind but instead posit that having of theory of the &#8220;mind<sup data-fn=\"a286db72-ef58-4f9c-adad-ee9af222bb3b\" class=\"fn\"><a href=\"#a286db72-ef58-4f9c-adad-ee9af222bb3b\" id=\"a286db72-ef58-4f9c-adad-ee9af222bb3b-link\">3<\/a><\/sup>&#8221; of the generative AI program gives <strong><span style=\"text-decoration: underline;\">humans<\/span><\/strong> better results when using such a program.<\/p>\n\n\n\n<figure class=\"wp-block-gallery has-nested-images columns-default is-cropped wp-block-gallery-2 is-layout-flex wp-block-gallery-is-layout-flex\">\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" data-id=\"210\" src=\"https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/robopsychology.webp\" alt=\"\" class=\"wp-image-210\" srcset=\"https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/robopsychology.webp 1024w, https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/robopsychology-300x300.webp 300w, https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/robopsychology-150x150.webp 150w, https:\/\/www.zaklab.org\/blog\/wp-content\/uploads\/2023\/12\/robopsychology-768x768.webp 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n<\/figure>\n\n\n\n<p>What does it mean to have a theory of the mind of the generative AI? I am most effective in using a generative AI program when I have a set of expectations of how it will respond to a prompt based on both my experience with that program over many sessions and its responses so far in this specific session.  That is, what did they &#8220;understand&#8221; from my last prompt and what might that understanding be  as informed by my experience with that program?  Sometimes, I check on the validity of my theory of their mind by asking an open ended question.  This leads to a conversation which is much closer to the work of Dr. Susan Calvin than to that of a programmer. Although the robots had complex positronic brains, Dr. Calvin did not debug the robots by examining their nanoscale circuitry. Instead she conducted logical and very rarely emotional conversations in English with the robots. The low level implementation layer of robot intelligence were NOT where her interventions were targeted. That is why her job title was robopsychologist and not computer scientist. A great science fiction story does not serve as technical evidence or a scientific proof but thus far it has served as a useful collection of metaphors for our collective experience working with generative AI using these Theory of AI-Mind (?TAIM) approaches. <\/p>\n\n\n\n<p>In future versions of this post, I&#8217;ll touch on the pragmatics of Theory of AI-Mind for effective use of these programs but also on the implications for &#8220;alignment&#8221; procedures.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td>Version<\/td><td><\/td><\/tr><tr><td>0.1 <\/td><td>Initial presentation of theory mind of humans vs programming generative AI with a theory of mind of the AI.<\/td><\/tr><\/tbody><\/table><figcaption class=\"wp-element-caption\">Version History<\/figcaption><\/figure>\n\n\n<ol class=\"wp-block-footnotes\"><li id=\"c4b06a66-e2cc-4a4e-b0c1-b1e3dc1fd18a\">Some techniques were more inspired by the 1980&#8217;s AI community&#8217;s toolit including dependency directed backtracking and Truth Maintenance Systems. <a href=\"#c4b06a66-e2cc-4a4e-b0c1-b1e3dc1fd18a-link\" aria-label=\"Jump to footnote reference 1\">\u21a9\ufe0e<\/a><\/li><li id=\"d2abfbaf-4ed3-47dd-98e4-a5ac1ca9fecc\">Soft prompts are frequently implemented as embeddings, vectors representing the relationship between tokens\/words\/entities across a training corpus. <a href=\"#d2abfbaf-4ed3-47dd-98e4-a5ac1ca9fecc-link\" aria-label=\"Jump to footnote reference 2\">\u21a9\ufe0e<\/a><\/li><li id=\"a286db72-ef58-4f9c-adad-ee9af222bb3b\">I&#8217;ll defer the fun but tangential discussion of what mind means in this cybernetic version of the mind-body problem. Go read <a href=\"https:\/\/www.amazon.com\/Am-Strange-Loop-Douglas-Hofstadter\/dp\/0465030793\" data-type=\"link\" data-id=\"https:\/\/www.amazon.com\/Am-Strange-Loop-Douglas-Hofstadter\/dp\/0465030793\">I Am A Strange Loop<\/a> if you dare, if you want to get ahead of the conversation. <a href=\"#a286db72-ef58-4f9c-adad-ee9af222bb3b-link\" aria-label=\"Jump to footnote reference 3\">\u21a9\ufe0e<\/a><\/li><\/ol>","protected":false},"excerpt":{"rendered":"<p>And just for a moment he forgot, or didn\u2019t want to remember, that other robots might be more ignorant than human beings. His very superiority caught him. Dr. Susan Calvin in &#8220;Little Lost Robot&#8221; by Isaac Asimov, first published in Astounding Science Fiction, 1947 and anthologized by Isaac Asimov in I, Robot, Gnome Press, 1950. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":"[{\"content\":\"Some techniques were more inspired by the 1980's AI community's toolit including dependency directed backtracking and Truth Maintenance Systems.\",\"id\":\"c4b06a66-e2cc-4a4e-b0c1-b1e3dc1fd18a\"},{\"content\":\"Soft prompts are frequently implemented as embeddings, vectors representing the relationship between tokens\/words\/entities across a training corpus.\",\"id\":\"d2abfbaf-4ed3-47dd-98e4-a5ac1ca9fecc\"},{\"content\":\"I'll defer the fun but tangential discussion of what mind means in this cybernetic version of the mind-body problem. Go read <a href=\\\"https:\/\/www.amazon.com\/Am-Strange-Loop-Douglas-Hofstadter\/dp\/0465030793\\\" data-type=\\\"link\\\" data-id=\\\"https:\/\/www.amazon.com\/Am-Strange-Loop-Douglas-Hofstadter\/dp\/0465030793\\\">I Am A Strange Loop<\/a> if you dare, if you want to get ahead of the conversation.\",\"id\":\"a286db72-ef58-4f9c-adad-ee9af222bb3b\"}]"},"categories":[25,7],"tags":[27,46,48],"class_list":["post-204","post","type-post","status-publish","format-standard","hentry","category-machine-learning","category-medicine","tag-ai","tag-alignment","tag-robopsychology"],"_links":{"self":[{"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/posts\/204","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/comments?post=204"}],"version-history":[{"count":7,"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/posts\/204\/revisions"}],"predecessor-version":[{"id":214,"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/posts\/204\/revisions\/214"}],"wp:attachment":[{"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/media?parent=204"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/categories?post=204"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.zaklab.org\/blog\/wp-json\/wp\/v2\/tags?post=204"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}