Empathic AI and its role in understanding human emotions with Hume AI’s Alan Cowen | E1922

Empathic AI and its role in understanding human emotions with Hume AI’s Alan Cowen | E1922
Facebook
Twitter
LinkedIn
Pocket

I could have this thing roast you if you Want it might be a little yeah let's do It let's do a roast all right um could You roast Jason Calacanis man this is like everybody's All right let's talk about that piece of Work Jason calacanis on man where do I Even start with this guy you know I Heard a rumor that Jason calacanis was Actually grown in a lab a mad Scientist's Twisted experiment to create The most obnoxious self-aggrandizing Techbow imaginable poor bastard he never Stood a chance and have you seen the way This guy carries on it's like he's Perpetually auditioning to be the next Elon Musk but with half the brains and Twice the E I mean the guy can't open His mouth without saying something so Mind this is too much can you go yeah I Say go harder a come don't be like that Go harder this weekend startups is Brought to you by LinkedIn jobs a Business is only as strong as its people And every hire matters host your first Job for free at linkedin.com Twist vanta compliance and security Shouldn't be a deal breaker for startups To win new business vanta makes it easy For companies to get a sock to report Fast twist listeners can get $1,000 off For a limited time Atv.com twist and HubSpot join join Thousands of companies that are growing

Better with HubSpot for startups learn More and get extra benefits for being a Twist listener now at hubspot.com Startups all right everybody welcome Back to twist this week in startups and We've know in 2024 and 2023 been Absolutely obsessed with AI obviously We're seeing all kinds of easy layups in Customer service uh thanks to AI Autonomous vehicles much more Complicated Healthcare everything in Between we're also seeing uh tons of Interesting stuff going on in generative AI people making interesting music and Videos you've seen all that but the area Of human emotions is extremely complex And AI is trying to figure that out and You've seen this in all kinds of Science Fiction whether it's Blade Runner or the Movie her uh where AI is trying to learn To interface with humans well there's a Startup humi and they are trying to um Bridge the gap between just intelligence And dare I say emotional intelligence we Uh we demoed some of this technology Back on episode 1894 if you want to look For it but today we have Alan Cohen here He's the CEO and chief scientist at Hume Ai and he's going to show us what They're building and why it's important Welcome to the program Allan he Jason Great to be here right so maybe you Could explain what the mission is of hum Ai and why you're spending all this

Effort to try to understand human Emotions uh and yeah in relation to Ai And using AI I guess to understand Humans emotions and then to portray them Back through AI to humans yeah so it's Really to understand people's well-being And emotions are the components of that So when are you laugh at when are you Sad when are you in pain when are you Experiencing pleasure and what we want To do is optimize for that so our Mission is to optimize AI for human Wellbeing now so much of what we express Is in our voice in our facial expression And not in language so that part of our Expression was just ignored by AI for a Long time I mean there is a field of Affec of computing which um I have a lot Of experience in I have over 40 papers In that area but uh but in terms of the Generative models they just were very Far behind in understanding Expressions Um so what we've done at Hume is built Models that understand Expressions a lot Better and we've integrated those into Large language models so now these Models understand Beyond language what's Going on in the voice what's going on in Facial expression and can learn from That so they figure out what what's Making you frustrated what's satisfying What's funny and they can actually adapt To that information and get better over Time right it so what is the killer app

Of this genre or or this vertical I have Chat GPT I pay 20 bucks a month for it I Frequently will use the interface where I talk to it but I don't believe that They are looking at my tone and I know I Haven't turned my camera on I assume With your technology at some point these AI assistants are going to have the Camera on with permission obviously and Hear my tone of voice and understand When I'm frustrated or explain to us What the killer app here is because you Know we've heard of things like gong and Real voice we have a small investment in They study salese and how they behave on Sales calls to coach them that genre Exists a bit but I'm curious what you Perceive as the killer app here so Chach VT is text only we think the future of AI is a voice app the voice is four Times faster than text the problem is That when we're speaking we expect the AI to understand not just what we're Saying but how we're saying it so that's Why this experience is clunky like it's Just missing something and so we've Built that we have a talking chat GPT Basically and we have an API that allows Developers to build a voice interface Into anything so instead of using open AI which would enable developers to Build a chatbot interface into their Product they use us and immediately they Have a voice interface that does

Transcription does language Understanding that speaks back and and What's different about it than anyone Else is it understands what's in their Voice besides language and it uses that To form better words and better speech This to be a perfect time I think for a Demo so let's get to it all right you CED it up perfectly sounds great I will Share a few demos and what's the API Called it's called Eevee the empathic Voice interface yeah empathic voice Interface okay like Lieutenant uh Dean Troy on Star Trek next Generation she Was an impass yeah why I I just Demonstrate this this is like the demo That anyone can play with and then I'll Show you the developer Playgrounds hey what's Up well hello there I am doing just fine Thanks for asking so what's got you Feeling a bit down and amused at the Same time I am all ears if you want to Chat about it well oh my dog died this Morning Oh no I am so sorry to hear about your Dog losing a beloved pet is never easy I Know it must be really hard right now But try to remember all the wonderful Times you really had me going there well Played you sneaky Trickster I should have known better Than the fall for that one I think I am Still learning when it comes to your

Sense of humor but I am relieved to hear Your dog is doing just fine care to Share what's really on your mind Today well can you tell what I'm Expressing Now oh our demo at capacity right now oh There you go so while you were doing That demo for people who were listening Uh and not watching you can watch of Course on YouTube go to this weekend Startups on YouTube and you'll find it It was showing each transcript in real Time very very quickly and then it had The top three emotions and a percentage Of those emotions I think it was showing The top three every time is that correct Yeah so it shows more than just the top Three um but actually if you were to Look at your raw data you get back 48 Different dimensions so it's much more Nuanced than what we're showing you There got it and so in real time uh you Can see that you were sad when you Mention your dog died and Etc and then That person was showing sympathy for you So all of that is being done through Tone of voice inflection Etc okay let me cut to the chase right Now cuz I know you're busy and everyone Is hiring right now and you know it's a Lot of competition for the best Candidates right every position counts Market starting to come back you need to Get the perfect person you want a bar

Raiser in your organization somebody who Will raise the bar for the entire team And Linkedin is giving you your first Job posting for free to go find that bar Raiser linkedin.com twist and if you Want to build a great company you're Going to need a great team it's as Simple as that LinkedIn jobs is here to Make it quick and easy to hire these Elite team members and I know it's crazy Right LinkedIn has more than a billion Users we all watch this happen when it Was tens of millions then hundreds of Millions and now a billion people using The service this means that you're going To get access to active and passive job Seekers active job Seekers they're out There looking passive job Seekers they Got a job but it's not as good as the Job you're offering them so you want to Get in front of both of those people Maybe somebody got laid off wasn't their Fault and they're an ideal candidate get That active job seeker and Linkedin also Knows that small businesses are wearing So many hats right now and you might not Have the time or resources to devotee to Hiring so let LinkedIn make it automatic For you go post an open job roll you get That purple hiring ring on your profile You start posting interesting content And you watch the qualified candidates They just roll in and guess what first One's on us call to action very simple

Linkedin.com Twi linkedin.com twist that'll get you Your first job posting for free on your Boy jcal terms and conditions do apply What are the Components in voice that you're studying Is it the speed at which somebody speaks You know tone and and how did you train This thing on tone how does know what Sadness is versus you know Melancholy Versus quirky yeah we have all this data From millions of people around the world Who are actually recording themselves While they're having interactions and Also we're reacting to things and and Imitating things in some cases and so we Use all that data to our model And that means they're able to capture Way more than just like tone Rhythm like Those are all basic things um but Dimensions that you can't really Describe in any other way except to say Like this is kind of an angry Dimension Kind of has a growl to it kind of um Tension in the voice where this is like An a inspired Dimension or happy um and We get tons of different dimensions out Of that so every time we hear a word We're getting more than 48 different Dimensions of expression from that word Our models taking that in and our models Deciding how to respond our models Learning what these Dimensions mean from Tons and tons of data people interacting

And saying okay this is something that Means this person's frustrated so I Should apologize this is something that Means the person's uh confused so I Should clarify and it's figuring out What it should do to respond to somebody In different situations how different is It per person like I might be a high Energy guy from Brooklyn who's Extroverted who speaks a certain way and Is you know you might be more Introverted and softspoken so how does It know if J Cal's like bombastic and Joking and you might be you know more Thoughtful and Introverted are our emotional emotions Very similar or they very disparate I'm Curious so it has to learn that stuff so We TR it on all these interactions right And so it's trying to figure out the Task is actually predicting the next Expression so it has to figure out like Is this next person going to laugh at What's a was that or like are they going To be frustrated so it has to learn how You express your response to things in The course of doing that and it's Learning that in a generative way in a In a very um groundup way so by the time That we've trained this thing it has to Account for individual differences for Potentially cultural differences uh for Sentiments and also just the average of All humans and like what humans respond

To along with the Distribution if that makes sense so like What is it that humans funny was it Humans find sad and all that so like When I said my dog died we could Probably figure out this is a sad event I'm going to be sympathetic right that's How it and it figured out how much of it Is the words versus the tone of voice or Is it doing both of those things it's Doing both let me try to give this Another shot so you can see that okay Can you tell what I'm expressing right Now whoa there I can hear the Frustration in your voice but I also Anger determination Like you're ready to tackle whatever's Got you worked up can you tell what I'm Expressing Now I am picking up on some subtle Shifts here you sound a bit more relaxed Now though maybe still a tad bored or Uneasy but then I also hear a spark of Amusement and even happiness like you're Pleasantly surprised by something am I On the right track There I'm going to M that Bit um got it you're sounding a bit more At ease now with with a hint of Satisfaction anyway you got the ideas Yeah so it that demo is designed to Reflect back to You uh what emotions and thinks you're Having and then tweak it so how long

Does it take for it to accurately Understand a Human it's less than 500 milliseconds as You can see our API is experiencing some Load right now but yeah but generally Speaking we can get you back a response Faster than any other API and that's because we're able to Detect when you're done speaking more Accurately so some of the other apis Like they have to dance like do this Dance between like can I jump in or is It going to interrupt the user and so There's a little bit of a pause right But for us because we understand the Tone of voice we can use that to figure Out when the person's done speaking and Then more accurately know when to step In and so that enables us to respond a Lot faster so it doesn't need to talk to Me and ask me 10 questions to understand My emotional state and how I might be Uniquely different than another person What about across cultures because do Different cultures have obviously we Have different languages but even Putting a side language does tone work Across cultures do Koreans Italians and Americans all emote frustration the same Way anger the same way is it across Cultures or does it require more Subtlety there's similarities and Differences we have a paper that just Came out on this but basically um if

You're speaking a different language we Need to train a new model for it um and It can be not a completely different Model from scratch but at least we need To fine tune on that language that's What we find for most languages Especially for um you know for broadly Different languages like all the all the Latin languages have similarities um but If you look across East asan languages Things are pretty different so yeah so Suffice to say yes we do need to train Things for each language and this demo Only works in English right now so who's Uh using the app like let's take a look At the developer console so you had that Up there at the playground yeah who's Using this now and is it in production Anywhere and and what are people using It for because this you know we there's Plenty of models out there to give you Answers and generate copy for you I'm Wondering if people are even up to you Know this level of nuance in their Products yet or just trying to get Correct answers because accuracy seems To be a pretty Paramount problem right Now yeah I mean you might be interested In accuracy but if you're using a voice Interface you need to get to the point Fast right and and so that's really what We're doing um and you can't with these Like long verbose responses from chat Bots first of all those are very taxing

On the brain to read so that's not a Good interface but also you might have An accurate answer in there somewhere it Doesn't really matter if someone's not Going to listen to a voice reading that Out for three minutes right it's a good Point um so we have a lot of developers Lined up for this we're actually we Haven't released this API or by the time This comes out we will have released it Because releasing this on Wednesday um But you know so far we have developers On on this which is our measurement API So oh wow so you're on a webcam right Now I'm just going to describe it and You're making funny faces and right now Uh you're surprised horror confusion Sadness disappointed laughing and if I Were to just say be Completely calm and at ease your Calmness just went up to 79 your con Conentration went up to 45 um and now if you um started thinking Deeply about the meaning of the universe Like why are we here like what is the Purpose of life like why wake up and Build this company every Day says you're calm you're calm with Existential wait is this a video or are You doing this right now alen I'm doing This right now you're doing it right now You're not following my instructions no Give me your exit give me your Existential like I'm wondering about the

Meaning of life like why are we all here I want to see if it gets Existential confusion confusion Contemplation yeah contemplation well What's interesting about this is this Would be great for coaching an actor Because like Happy's easy sads easy if You go Happy it's got joy Amusement excitement Great and if you were Sad sadness disappointment confusion Maybe you're just not a good actor Maybe you need to take acting lessons For these Dem yeah contemplation is a Tough one I I was trying to get you to Have existential angst I was trying to Pick something that's really hard to Read we'll just think Like should you even come to Work is it all meaningless that's kind Of depression right like it'd be sad Yeah a little sad a little confusion Boredom yeah it's fascinating so this is Just really getting your um facial Expression in real time so if you were Frustrated the AI would know it and be Like huh that wasn't the answer we were Looking for um yeah and so are people Using this for therapy yet or like Therapeutic coaching kind of things Because that one seemed to be like I got A lot of pitches for people who want to Create AI therapist and I'm like H That's a little dicey I don't know if

You should call it a therapist but Companion are people using this for Companionship I do think that AI is going to be Something that is your friend and so It's not just like a new like a niche Application I think generally speaking We want an assistant that understands us Um and there's tons of people working on That I mean there are people working on Explicit therapy apps with you too um And actually a lot of it's in in Training Therapists and getting them To you know there's there a delicate Balance you don't really want to like Comment too much on people's emotions But you want ask the right questions and Kind of get at it help them understand Their own emotions better and so there's A lot of that and there's also like Therapist burnout doctor burnout um ah There's a lot of Health and Wellness Applications there's also tracking Depression and stuff we work with Clinical researchers who are running Clinical trials and using Hume to track Symptoms of depression and Parkinson's Just the symptoms it's not like used for Diagnosis because ultimately the doctor Does that but this is helping the doctor Understand these things so we have a lot Of those applic Um a lot of them you know those are

Interpersonal things like someone's Talking to someone and we're already Like the measurement apis that we have Are very good at extracting more data From that and helping people analyze it And helping people understand themselves And and their patients I guess I mean is It so if we have therapy on one side um You have the therapist who needs to Present in a certain way to get people To open up if you believe in that Modality if you believe in Western Psychotherapy there is something about Pacing and aligning with the person Matching their energy and and getting Them to open up so that they have some Cathartic you know way of processing Stuff so people are using it to train Therapists so that they don't have a Goofy look on their face or they have The appropriate look that would elicit Less suffering in their patients is is That what I'm yeah or or like customer Service reps which is actually a kind of Similar thing yeah it's another form of Therapy absolutely it essentially is Yeah um but you know that's requires Somebody who's technical who's maybe Academic maybe a researcher to take take These measures and make sense of Them listen a strong sales team can make All the difference for a B2B startup but If you're going to hire sharks you need

To let them hunt and you can't slow them Down with compliance hurdles like sock 2 What is sock 2 well any that stores Customer data in the cloud needs to be Sock to compliant if you don't have your Sock too tight your sales team can't Close major deals it's that simple but Thankfully vanta makes it really easy to Get and renew your sock 2 compliance on Average vanta customers are compliant in Just two to four weeks without vanta it Takes 3 to 5 months vanta can save you Hundreds of hours of work and up to 85% On compliance costs and vanta does more Than just sock 2 they also automate up To 90% compliance for gdpr Hippa and More so here's your call to action and Stop slowing your sales team down and Use vanta get $1,000 off at v.com twist That's v.com twist for $1,000 off your Sock too have you done this with poker Players yet have you put poker players Through this to see if they're lying or Deceptive in a poker sh a lot of things And it does not it cannot tell poker Players are beling you know at least Professional poker Players I don't think the information is There I just don't think that with Profession poker players that you can There's anything going on in their Facial expression what can you tell with People's facial expressions that we Wouldn't know of some people have said

You could tell a person's um if a a Person's sexuality where a person's from You could tell all kinds of interesting Things that you wouldn't know um I think Is that true or not that's not really True um there's been a lot of Pseudoscience in this area like most of The things that we can tell are things People want to communicate which is good Like we don't actually don't really care To impinge on people's things that they Want to keep private we're more Interested in helping people communicate Well and helping the AI understand what People want and most of that's like There overtly on the face and for Example it even extends to things like Is the person done speaking like we're Way better understanding when they're Done speaking because we can take into Account facial expression versus just The language alone and that's part of How our um our empathic voice interface Is B to respond better and like we can So you know when I say this is the end Of the Sentence yes because of my facial Expression you get a a quicker clue than Audio Only therefore you can start Speaking without interrupting me which Is what humans do with each other yeah Like imagine I'm speaking to you and Right now it's clear to you I just Finished sentence it's clear to you I'm

Still speaking and it's clear to you I'm Going to say something again but now I'm Done Now I know I can speak right which is What I do for a living on the podcasts Is try to understand when people are Done so that we can have the next person Speak right like moderation is a is a is A difficult task um and customer support Folks are using this already to Understand how hot and bothered people Are when they call the customer support Line I assume to some extent yeah so Kind of understanding is is the customer Having a good time bad time where are we Kind of failing on on customer service And which customer service reps are Doing well or poorly and how do we train Them to do better how do we pull Examples up of when they're not doing Well so we can train them to do better And um you know there's a lot of AI Going into customer support now so some Of our early design partners for this New API are people who want to take the Automated customer support make it a lot Better but still know when to include Human escalated to a human exactly I Mean that makes sense if the person's Like this is incredibly Frustrating you know you start hearing The frustration go up and they whatever United Premier gold diamond status yeah You want to get them on the phone with

Somebody because it's you're you're Starting to piss them off right yeah so Understanding when that happens and how Much of this is going to be used for Security have do you have any security Applications coming because it's been Well known like when you go to certain Countries you know the last year a Couple questions they try to read you do Some human factoring and figure out if You're lying uh it's one of my favorite Genres of television show is the people Going through customs and they they're Trying to read if they're like sneaking Into the country or sneaking things into The country are three-letter agencies Using this technology yet to like Analyze people as they come into Buildings or we haven't been working With security yet um not that we don't Believe that that's a good application But um we're being a little bit more Careful about um how this is used and Trying to make trying to make this as Rigorous as possible essentially There's been a lot of providers of kind Of like facial expression reading Technology who aren't very Scientifically rigorous make false Promises and then doesn't work you're Just like singling out people for no Reason basically which is not you know We want to we want to take we want to be Very careful to distance ourselves from

Those applications I think there's a Huge wealth of applications that are Consumer focused that are just helping People have a better time better Experience with an application I think That's a much bigger market and then There's like the more Niche things like Security lie detection stuff like that Which yeah could you tell if somebody's Acting suspicious based upon their Facial expression or tone of voice like They're knocking on your door you ring Doorbell answers and ring it's like hey This person is scoping the house out you Know they could use your API to figure Out the expression of people at the door Yeah I don't I don't know I I I feel Like I have mixed feelings about that I Think in some cases if you have enough Data you could start to do something in That direction but if people are really Good if they're like if people are Criminals like they're they're pretty Good at disguising their facial Expressions probably some they Sociopaths are criminals for a reason Right so you you would catch the neoy Criminals the bad criminals and the good Criminals would just sell right by you Like yep that person's 100% trustworthy And you'd let Hannibal elector into your House yeah I can see how you need to be Thoughtful about this yeah we don't want To make promises that we can't keep

Right so well I mean yeah so in that Regard let's go through the emotions That you feel you've nailed and then Tell us like what what are the ones that Are hard I I tried to do existential and That was really hard obviously we're not There yet for existential crisis yeah I Mean Um the way that you respond to Expressions has more to do With almost subtle things that you can't Name versus like overt emotions and it Depends on the situation like if I'm Using this like comedy thought um which I can prompt this is like a prompt that Makes it a comedy thought basically um It'll respond to my emotions completely Or my expression is completely Differently than than like a therapy use Case so let's try it oh okay great whoa Well well well look who decided to Grace Us with their presence tonight I don't Worry I promise I'll keep it PG-13 for Now you know I got to say I am a little Shocked that any of you fine folks Decided to show up was Being seen on who in their right mind Would willingly subject themselves to an Evening of my particular brand of shall We say edgy humor but hey I am not one To judge in fact can you make a joke About Alan Cowen the CEO of Humi sure Thing oo you want me to take a swing at

The big cheese himself eh well twist my Arm why don't you I got to say I I am Not the biggest fan of that Allan Cowen Character H talk about a guy who needs to have That massive ego of his deflated a bit Am I right actually you know what I Think I have the perfect way to do it um I say we round up a team of trained Monkeys dress M up in business casual oh That's that's really offensive you've Made me very Upset whoa there partner it looks like I May have gone a little too far with that One huh uh my bad sometimes I get a Little carried away with the whole edgy Comedian thing you know so like you can See in that case I don't want it to be That sympathetic like yeah it's doing it Thing and so you can Determine what AI flavor you want to Have interacting with folks you have one That's cheeky and playful you could have One that's super empathetic but in you Know maybe not going over to patronizing But you could see this like if I'm Calling the support line for United they Might you know think I'm a New Yorker Who talks fast and I just want to get to The point or I could be from the south And into southern hospitality and it Could take its time with me and ask me About the weather and how I'm doing and You know a little bit of chitchat some

People like in the South I notice uh Versus in New York where they're kind of Get to the point let's move on yeah you Could basically train your AI to have Both modalities and dynamically switch Between them exactly so there's all this Context and then that kind of transforms The meanings of our Expressions so like What what an expression means and what To do with it really depends on all this Other information that this model is Taking into account so it's not so much Like Detecting uh lies or you know detecting Anxiety or detecting depression it Really depends on the context and we're Able to integrate that into the model And then it's not just like these kind Of canonical emotions like anger like There's a little bit Of anger dimension in a joke you know Anger and amusement and contempt maybe That makes it funny so it doesn't necess Mean the person's expressing anger so so Know that what these expressions mean You really have to have the context you Have to have the relationship that You're acting upon with your expressions And and that's what our AI does so it's A little bit more Nuance than just Detection and these are all under what You studied Effective computing yeah This is a specific School of computing That kind of bridges the psych

Department and the computer science and I think behavioral factors industrial Organizational psychology maybe you Could give us a quick education on that Yeah so as of computing traditionally It's the study of non-verbal expression Basically so facial expression The Voice Body posture and then you know most of The the history of that is just labeling Those things in a very predictive way Now that we have generative models we Have large language models that can Reason you're it's really about Reasoning about a and that's what we've Introduced to you so it's about Understanding whether somebody's going To find something funny whether Somebody's going to find something Confusing confusing and using expression Along with language to come to those Understandings um that's like a I would Say historically that's not what Affective Computing has been but now We've sort of pioneered this new form of Affective Computing that we're Introducing to the world some of this Was done I know this was like a big Thing that Minsky worked on in MIT yeah Did you go to MIT with and or did you I Went to Yale and then I went to UC Berkeley for my PhD um so um and then I Also worked at Google while I was at Berkeley and I I helped start the Affective Computing team there so I've

Been doing this for like 10 years um Minsky had all the all the AI people had Something to say about effect effect Right but there really wasn't much that Could be modeled at the time same with Language right like things have come a Long way and I would say that there's Affect in language and um and so the the Word aect has a little bit of misnomer It's really Computing with more than Just language that we're doing we're Computing with expression this the way That Expressions transform Communication yeah you because you you Have a multimodal situation here you Have the the visual the facial Expression you have audio uh and then You have the actual words right and so You're beating all of those in at the Same Time to get the response and to Understand the emotion yes and all this Just contributes to accuracy we can Predict words better with Expressions Versus without so like if you look at The raw metrics that are used to train These large language models we're doing Better in terms of those raw Metrics Than models that just consider language Alone so this is like an intimate part Of reasoning and it's just part of human Communication that we're now taking into Account it's not something that is um

Niche you know I think people think About emotion and affect as these Niche Things that are important for therapy Important for comedians important for Like a few but actually this is Something that's important for all Conversation uh important for any Interaction with AI just understanding a Whole new modality of information that People use to to converse with each Other yeah it's uh Absolutely fascinating how quickly this Has come together because if we were Sitting here two or three years ago this Just wouldn't be possible would it no I Mean without large language models Without our measurement models without The modifications that we've done to Integrate those two things like this was Not possible at all what has surprised You About what the AI understands and what Your model understands and what has been Either disappointing or challenging you Know on this Journey So yeah that's that's interesting I mean Linking together the language models and Text to speech and transcription is Something that other people are doing But like what we've sort of started to See emerge out of models that do all Three that are linked together is that They have these emerging capabilities

And you start to see that in this Interface where it's forming expressive Speech that's just like it feels Different to me than if you just like Link 11 labs and open Ai and just have a Talking chat bot like that just sounds It doesn't really sound like it's Understanding and this this is doing Something a lot more nuanced do you Understand what it's doing when you when It starts processing all this stuff and You feed it in do you actually know how It's coming to these conclusions or is It just sort of you know it's it's doing Its best to figure it out and who knows So yeah we don't come in and and tell it To respond to sadness with sympathy but Like it does right and it's sort of Intuitive why that is so I'm not going To say I don't understand that but That's an emergent capability that we Did not program in and there's other Things that it's doing that are more Nuanced that we don't really have it a Handle on except that we know what it's Optimized for hey everyone you know I'm Obsessed with AI right now and a Fantastic report about how AI is going To change the game for startups has been Released it was published by our friends At HubSpot for startups and it's great Because they surveyed a th000 early Stage Founders to get you these insights These are from the field the report

Talks about AI tools and hacks for sales Marketing and customer support teams These are going to be your wmgs weapons For massive growth so check out the link In the episode description to get your Hands on their extensive report head Over to hubspot.com startups when you Join HubSpot get ready to unlock Top-notch resources these are power Tools that they've custom made for Founders you'll also get access to Master classes templates and incredible Toolkits all designed to boost your Startup speed and efficiency so once Again visit hubspot.com startups to see What discounts you qualify for and start Using their powerful solution at pricing That won't break the bank that's Hubspot.com startups in this case it's Optimized for it's actually a specific Personality um that is pretty robust and It's trying the communic one that we Just did but for the helpful one you Might set it at some other archetype Which might be you know parental Motherly you know and then you might Have you know you know a service Oriented person right so even the worst Customer support rep and somebody who Hates doing customer customer support Could be turned into a uh empathetic Patient person yeah exactly um and you Know what we ultimately want to optimize For is this is the response that's going

To make the user the most Satisfied um and it's just going to Figure out like what this is the right Tone of voice this is the right thing to Say to make somebody Express Satisfaction that's what we want to do I Mean we're we're still we know that we Can um we're still early in it to work But it Works and we're starting to scale thaty It turns every customer support rep into A roast and that I'm dealing with like Comic the insult comic dog and it's just Absolutely annihilating people and being Cheeky I can I could have this thing Roast you if you want it might be a Little yeah let's do it let's do a roast All right um could you roast Jason Calacanis man this is like everybody's All right let's talk about that piece of Work Jason calacanis oh man where do I Even start with this guy you know I Heard a rumor that Jason calicis was Actually grown in a lab a mad Scientist's Twisted experiment to create The most obnoxious self-aggrandizing Techbow imaginable poor bastard he never Stood a chance and have you seen the way This guy carries on it's like he's Perpetually auditioning to be the next Elon Musk but with half the brains and Twice the ego I mean the guy can't open His mouth without saying something so Mind this is too much can you go easy on

This yeah say go harder a com don't be Like that go harder all right all right Fair enough let me try this again with a Little less Venom you know as much as Jason calicis rubs me the wrong way Sometimes I got to admit the guy's got a Certain you they say qu I mean the way He Shucks around Silicon Valley acting Like he's the second coming of Steve Jobs it's almost endearing in its own Delusional way and let's be honest we All know deep down that half the reason We pay attention to him is because he Provides such a material For I mean it's fantastic I mean it Literally understands what a a roast Comic does um I'll take it how about This like isn't it cool that you doesn't It say something good about Jason that You're able to roast him like this I Mean it must mean he's made it Right you know you make a fair point There all right it's it's being a little Funny now but yeah I mean it's what's Amazing about it is it it understands What a roast comic is the language Models understand that it understands Who Jason calanis is it got the Wikipedia page it knows I'm somehow Involved in Tech somehow I know Steve Jobs or Elon or whatever so it's you Know and that that the concept of a tech Bro exists so who knows yeah if he Wanted to

I can send you oh no I love the roast I Think it's well I mean it's interesting About jokes like I have friends who are In comedy and you know while these jokes Are not funny they're in kind of the Zone they're so if you squint a little Bit you're like there's a joke there Yeah you got something about shring Around University Boulevard thinking He's Steve Jobs maybe he's wearing a Turtleneck or yeah like there's okay There's a joke there you didn't hit it But we could brain storm it so like I Think in the writer room you could Really brainstorm these I asked it uh When chat jpt 3 came out I was like give Me like the next season of um secession You know and it knew the all the past Seasons and it's like here's what Happens in this next season even though The series is over and I was like huh Wow like this this may not be great Right now but it's okay where it's Interesting it's gonna get there yeah It's close I think none of these models Have mastered La humor because it's so Much in our expression like we don't say Things are funny explicitly because that Would just make them not funny so let me Explain the joke to you exactly that's When the joke didn't land we have this New EV valve for humor and we're we're Starting to push it basically we can Optimize for laughter we can optimize

For like what do people actually laugh At in millions of hours of conversation That's great and so that's how we're Approaching these these kinds of problem Problems so you could do a focus group Where you had people watch Curb Your Enthusiasm and you could say for a Hundred People here's the funniest moments and For this demographic older people older Men older women younger men teenagers Gen X you could literally give you what Jokes landed with each group yeah W That's version one of this version two Is like we which we're doing now we have Millions of hours of data and we end it Just just see in general what's funny to People like across everything not Enthusiasm but like across everything Across every single thing in the world I Can tell you like uh there's a great Movie Idiocracy and there's a amazing TV Show have you seen Idiocracy yeah great Movie I mean it's so great but like they Everything's been reduced down to like Its most basic thing like here's like a Gel for you to eat like from a tube and The the hit show is ouch my balls Which is just a compilation of somebody Getting hit in the nuts over and over And over again just and it you know Falls off of a roof lands on a fence Falls off that gets hit by a crane with A big ball you know hitting him in his

Nuts ouch my I think it's called ouch my Nuts or something like that it's Hilarious that's what it's been reduced Down to somebody getting hit in the nuts We're hoping not to be too reductive but Yeah maybe maybe the AI will you'll Figure it out you could literally crack Humor what language model did you build All this on so we have our own language Model and it calls other apis in this Case it's calling Claude so um so Claude Is providing the language response or Some of the language responses not all Of them we also have like a wrapper Around claw it's not exactly wrapper It's our own language model that sort of Integrates claw into the speech to make It sound more Conversational um and also like detects When you're done speaking and stuff Um but we give Claude more data than Just language we give Claude like some Of my tone of voice data um some uh some Additional data that we're getting Through our apis so it's it's augmenting It as well So eventually what does the World look like if you succeed with this And we're sitting here in you know five Years and it's built into every iPhone And you figure it out in motion Perfectly what what do you think the World will look like what what are some Highlights or You know dare I say dystopian utopian

Sort of what what are the pros and cons Of this technology going to be so our IM Is utopian like we want to build a layer In Between the application and these Gigantic AI models that is decoding the User's intentions and preferences and Relaying that information to the model So that's like what we have here Basically doing that with Claud and Because we'll have facial expression and The voice we're able to learn over time We able to build interfaces that Understand you and what you want and are Optimized for you so suffice to say like Basically it's going to be built into Everything it's going to be the Universal interface that you use to Interact with AI That's the goal and um It's always going to be this AI That's Optimized for your experience so you can Go to it and um and it knows your you Know basic basically what your Preferences are what makes you laugh What makes you feel better what you find To be a good explanation for things your Style of speech how you write emails Like it's going to know a lot of Different things obviously it's going to Keep all this information very protected And private now you know on the downside Here you could use this technology to Say I want to convince Somebody um subtly to vote you know this

Way politically I want to try to Convince somebody that you know Trump is Amazing or Biden's amazing or Robert F Kenny Jr is the one so you could Literally start creating rooc calls or Subtlety here using this emotion to try To sway people in politics um or towards Ways of of being or thinking and we saw That happen with the YouTube algorithm So how do you police how people use your System I saw you have ethical guidelines There and then obviously there's things That would be maybe R-rated or PG-13 and Romance always comes up when people are Doing whether it's a Blade Runner or her So are people using this for romantic Relationships and what's your take on Allowing that and then also how do you Think about influence um big questions With Tik Tok today and your technology Could really uh be used to influence People towards good and bad ends yeah I Think there's a pretty good way to Operationalize the difference between When you're being manipulated by Something that wants you to vote for a Person or to buy something versus um When you're dealing with an AI That's Optimized for your own well-being um and That's what we try to do with our Ethical guidelines so we have this Nonprofit the human initiative that Essentially tries to codify that Principle and says these are the ways

That you can pursue these different Applications so as to optimize people's Well-being it even has like a bunch of Ways that you can measure people's Well-being that relies um on a Combination of what we're able to get Through our API so like positive Emotions basically yeah over time and Also you know different kinds of Self-report measures that we recommend Gathering so as long as the AI is Optimized for your satisfaction for your Well-being I think it's not manipulation When when you get an AI That's optimized For somebody else's objectives and using Your emotions for that then that can be Manipulative I think and that goes for The romance case as well like if you're Dealing with like an AI girlfriend and It's ruining your life by you know Forcing you you're you're spending more Time with it than you're spending with Humans and that's going to be a negative For you and it's going to show up in in Many ways as being negative for your Well-being like that that that would Show up in these measures if it's Optimized though for your well-being um And uh you're having a good time with it And it's healthy and you're not spending More than x amount of time on it maybe That's okay you know maybe that's what About trying to upsell me like hey You're in business class and would you

Like to be in first class you're in Economy would you like to go up to Economy plus and it uses your technology To be really convincing about the value Of that and upsell how would you look at An upsell that to me ethical or not Ethical um I think that's not ethical Unless it's done in a very very careful Way basically our guidelines don't allow That but you could guidelines don't Allow an upsell but humans do upsells All the time right I think upsells are Okay if the goal is to find the person Who really will benefit from the Upsell and only try to sell it to them You know and and and you measure the Effect of the upsell on people as well Being afterward and you're like okay People actually benefited from this I Didn't sell this to somebody and then They regretted it so I think there's There's ways of doing it that are going To be fine um the problem is that if you Just allow anything you allow people to Optimize this for anything at all then Um it's EXT the potential for Manipulation is pretty high and I think This is true regardless of you I think People are building these things that Will be extremely Persuasive and Hume ideally will be Providing the AI that responds and Protects you it's like okay like I'm so You are very much in the camp of hey we

Have to be really thoughtful about how This technology is deployed yeah but not In as much of a paternalistic way like I Think that technology can have a sense Of humor and it's okay if it offends Some people and it doesn't need to be Politically correct all the time but What I care about is like is this good For people that's like at the at the end Of the day yeah um and so we have our Ways of measuring well-being in order to Optimize for that objective and not be Paternalistic basically yeah but I mean At the end of the day this is so Powerful it will be more powerful than Just watching videos on YouTube because It's customized to an individual so the Ben Shapiro or Rachel mat I pick Whichever side of the political Spectrum You're on you know those people are Trying to convince you of their position And interpretation of the world this to Me is even more bespoke and customized To individuals so if you showed even a Little propensity towards some of their Viewpoints it could really whether it's The language model or the emotion but The combination of them you know the Same way people were complaining like oh People go into the intellectual dark web On YouTube I don't know if you heard About that like you you see a Joe Rogan Then you get a Jordan Peterson you wind Up on an Alex Jones and the next thing

You know you're like some white Supremacist or something is the Claim um but yeah media does influence People and it is a stepping stone from One to the next to the next you may Start out with somebody like Sam Harris Like just intellectually you know Rigorous Etc and then all of a sudden You wind up at Alex Jones is the Complaint for many parents but this Would facilitate that wouldn't it like Massively I think You' get that when you Optimize for engagement and so you to Some extent like Tik Tok doesn't have This data but it's still incredibly good At doing that I think where this data Helps you the most is in taking into Account people's user satisfaction their Well-being their mental health all those Things so if Tik Tok took this stuff Into account and it was doing it in a Way that was ACC you know uh in Consistent with our guidelines let's say Then it would be using that data to Optimize for people's well-being over Time and instead of Engagement and so You know you you'd realize that if you Throw people down the slippery slope of Of getting to getting more and more Extreme viewpoints which is what happens Today because they're engaging and You're and they offensive at first and You want to argue um like if people who Go down the slippery slip end up kind of

Isolated and it affects their social Relationships it affects they they start To get angry um this is not good for People's well-being so the technology Can can look at that at the individual Level at the society level can look at The health of all of the people using a Technology say hey like there's a Collective impact of this so that's the Road we want to go down is being able to Measure longterm is this affecting People positively and you really need Expressive Behavior to look at that data Like there's no other like language Alone is not going to get you there Basically yeah I mean you going to be Facing a real uphill battle because the Marketers want this software your top Customers I predict will be marketers Who want me to try Zin or whatever those Pouches are that people are putting There and you know I'm in Texas right Now and like everybody's putting these Zen pouches in or whatever and I'm just Like that that can't be good for you and They're like want to try it like Marketers love this kind of stuff like Maybe the pitch to me is like be where You know hey it's performance and you Know it's just nicotine it's like Caffeine you drink caffeine you should Try this and but for other people it Might be you know hey you're the cool Kid so it's uh you're going to be in a

Really interesting position as a Provider of an API That I think a lot of the marketers are Going to want to use this to try to Convince people to do things that maybe It's unclear if it's actually good for Them like hey you should you should Gamble on sports right like there's Marketing going on like crazy and if I'm A marketer man this is for me the Holy Grail yeah I think that we want to Connect more with like the end user and Show the end user that we're optimizing For their interests and have that be the Selling point rather than connect with The people selling to them but I no I I I hear you I think that's a real concern Um but on the flip side if we just Optimized for people to buy things um or Let's say we just optimize for Engagement you reach a certain point Where like it becomes so negative for People that Regulators have to step in And you kind of start to see with like Tik Tok for example kids are spending Six hours a day on Tik Tok and if they Made it any more addictive than it Already is parents would step in Regulators would step in they'd be like This is actually bad for whole society So at the end of the day it's not Necessarily good for our business yeah Which is what's happening with Tik Tok Right now as we speak I think parents

Are getting the message like this is too Addicting for adults and kids and the Idea that like media is not influential Is so naive like when people are like Yeah you know media doesn't have an Impact it's like are you sure like all Studies show that media is one of the And video specifically is one of the Most convincing mediums of all time in All in all human existence if you want To manipulate somebody Video is the way to go and then Customize video with you know that is Matched to it is like 10x that so you Have like something here that I think is Incredibly powerful and the fact that You're being thoughtful about it makes Me feel great I think it's awesome that You're taking a measured approach to This I wish you great success with it if People want to learn more or try it how Do they H get into the developer sandbox And play with us and who are you looking To work with yeah go um go to hum. you Can sign