Synthflow Podcast

Podcast Show Notes: Fireside Chat with Tom Chapland of Canonical AI

Episode Overview:
In this episode, we sit down with Tom Chapland, CEO and co-founder of Canonical AI, to discuss the evolving landscape of voice AI agents. Tom shares insights into how Canonical AI is helping developers and their customers gain visibility into the performance of voice AI agents, unlocking growth and building trust. We also dive into Tom’s journey from agriculture technology to voice AI, the challenges of building voice AI systems, and the future of the industry.

Chapters:
Key Takeaways:
  • Visibility is Key: Canonical AI helps developers and their customers gain visibility into voice AI agent performance, building trust and unlocking growth.
  • Voice AI is Still Emerging: The voice AI space is rapidly evolving, with many opportunities to improve and refine voice AI agents.
  • Integration Made Easy: Canonical AI offers easy integration with platforms like Synthflow and Make, making it simple to start analyzing call data.
  • Future Features: Canonical AI is working on advanced features like self-healing systems and more granular analytics to further improve voice AI agents.
Call to Action:
If you enjoyed this episode, make sure to subscribe to our podcast and leave a review! For more insights and tutorials on voice AI, visit our Synthflow Academy.

Creators & Guests

Host
Tom Osman 🐦‍⬛
Growth @synthflowai

What is Synthflow Podcast?

Welcome to the Synthflow Podcast, your go-to resource for exploring the latest advancements in voice AI.

Join us as we dive into insightful discussions with industry experts, share real-world applications of AI-powered voice assistants, and uncover how cutting-edge technology is transforming communication, productivity, and customer experience.

Whether you’re a developer, entrepreneur, or tech enthusiast, this podcast will keep you at the forefront of the voice AI revolution.

this conversation I speak with Tom Chapland of Canonical AI

now

this conversation was initially held inside of the Synthflow Academy

and was a fireside chat

so towards the end it did involve a bit of a screenshot

so if you get lost towards the end

I wanna see the outcome of that screen

showing what chemical is all about from a UI perspective

please head over to our YouTube channel and watch it there

but hope you enjoyed this conversation with Tom

okay I guess we can we just get a rock and rolling

if anyone's filtering after the fact

we can so thanks everyone for joining

I'm joined today by Tom Chaplin who's building canonical AI uh

which is a mixed part of the voice AI agents

you missed that intro before

so I guess kick things off

how you doing Tom and you wanna give us the quick one line intro

who you are and what is canonical

canonical they say

yeah you did great

you said it right uh

yeah I'm I'm Tom Chaplin

I'm the CEO and co founder of canonical

and and we give you visibility into what your voice AI agent is doing

we also give your customers who you're building voice AI agents for

visibility into what your voice AI agent is doing

and what we're finding in building this product is that um

there's some hesitation around the voice AI

your voice AI customers to expand their usage of the bogus

AIs that you build them um

but once you give them more visibility into what's happening

and they can see that the agent is performing as expected

that's what really unlocks growth

and our mission is to help voice AI developers unlock

the growth in voice AI um

by by helping their customers

feel more comfortable with putting their brand in their sales funnel

on the line with the voice AI agent

you're building them

yeah I like that

um it does seem a lot of the time it's a bit of a black box

and people can say the voice

I do certain things but can you back it up with the actual data

so this this course successful

did it go through did it completely gone etcetera um

you've been doing this for like a year

it's kinda a year and a half August 2,023 if this is correct

so you've been in the space for a little bit

is this is this always been the same product

no actually we started with something else

so I'm gonna go a little bit further back in describing my background

um I started an agriculture technology company back in 2014

and that company got into y Combinator um

it was a great company we

we were building a

sensor that help farmers figure out how much to irigate um

is based on my PhD work and is really neat data feed is

is a good company um

and in the process of building that company

I became more acquainted with nondeterministic

neural network type models

um it was uh

when like the Tensor Flow API came out

we're building cool computer vision stuff on top of our data feed

um and really liked it

and eventually I found a good home for that company

that company was called Tule and was acquired by Crop X

they're great people um

I want to start up journey again

and of course um

my co founder and I who was also the CTO of my last company um

we want to work in LLMS and initially we're building a cache in there

the idea of a cache in there is it's like a smart database

every time a query came in

um from a user asking an LLM something

we'd look in that smart database to see if

essentially the same question was answered

asked before if it was

we could return the answer from the database

rather than calling the slow

lumbering LLM um

and that LED us to reach out to a bunch of voice

AI developers to see if they'd be interested

this was about you know

that year and a half Mark ago

um

and what we found was like yeah

latency was a problem but well

actually I kind of skip to step there

the the thing I really wanna emphasize here is like

we just fell in love with the voice face

we started meeting these voice

AI developers and seeing all the cool stuff they're building

and just seeing the incredible potential for voice AI

that everyone here in this

in the session with us believes and feels

and as examples of seeing it um

and we just fell in love with the voice AI space

and we realized the cash wasn't the right thing to build

it was too slow our growth was too slow on it

I should say um

but what our friends and voice guy kept telling us was like hey

like I've got these contracts with these big potential clients

but they're only dipping their toe in the water

and I need a way to make them more comfortable with my voice AI

and I don't really blame them for not being comfortable with it

because I'm like manually listening to a few calls

I don't have any sort of visibility into what's happening

can you build something that makes it easy to show people like

here's what's happening in your data

and everything's going as expected um

so I started building that about um

I mean we shipped the dashboard in early October um

and it's been it's been really fun

it's been great ever since um

I just love being a part of the Voice Act community

and really glad to be here and

and hopefully get to meet some of you that have joined and

and get to make more friends in the space yeah

that's cool was there one demo when uh

you were kinda exploring options where you thought like

holy crap this is the space that I need to like apply my timer to

ah yeah

like what was the moment where I was like

oh voice AI is so cool

I think for me a lot of it is um

like when I first logged into Sinflow

and it must be similar for so many people

like you first log in and build your first agent as like

it's just a toy and it calls you and you're like

oh my God I can talk to it and it just

I feel like I still build

like voice AI agents just for fun on the side

like little little toy demos just like for fun

I built one that was like a baseball game for my son um

I feel like I'm an artist with a palette

in this really interesting palette that I get to work with

with voice AI and I just think it's such a neat space to be in

and also I see a lot of calls that um

people make um

voice agents make and they just

it's amazing to see how well they can work

you know like when they just nail it

the customer skeptical and they're like

let me talk to a representative

and then the voice guy just solves their problem

at the end they're thanking the machine

it's just like magical it's clearly the future yeah

some of the times when you listen back to a call um

so if you're testing an agent and I've done it a lot of times

you're tweaking a prompt

and suddenly you hear like a recording of a conversation

and you're like damn

that was actually really good

so you're tweaking things like sentence lengths and like do say this

then say that and when when you get that conversational flow right

yes yes

my music to his yeah what was that for you Tom dude

like what was the moment where you like oh

voice the eyes the space I need to be in

um so I've always been have like a

had a community going for a few years called Shiny Object Social Club

where it's just a bunch of us hanging out in a discord

and playing around with any new tech

that's interesting and then uh

I think 11 lads came along when it was like synthetic voices

and you could actually just get into say stuff

and then there's a bit of avatars coming around

and then that got applied to phone calls with things like sin flow

and battery at the time

and uh I wasn't extreme an extremely technical person

so sin flow is obviously the

that the natural fit being a more no code solution and the

the feeling it just gave

that I could now have this power to create these really powerful uh

workflows and tools and functionality was just yeah

it was just wild and then things like um

connecting it with other alternations

which we're gonna have a look at in a minute

where you can not only extract information from the phone call

but also have it trigger other workflows and suddenly

and make internal calls and outbound calls and hooking together

it's just just felt like everything is now on the table

yeah yeah

absolutely yeah

it's cool so yeah and

and it is is been kinda missing for a bit what we seen with chemical

and do you wanna give us the rundown

we're gonna do a screenshot too so you can do that whenever you want

um so basically this base if I'm right

this gives you a really easy way to see

how your agents are performing

that's correct right

yes that's the idea

gives you and your customers an easy way to see what your voice

AI assistant is doing and for you

it helps you improve your agent and for your

your client it helps you feel comfortable with what you're doing

so that they'll start pushing more call volume through it

hmm got it and have you seen have you seen that the take of it

is this meant for um

like small regencies s and B's escaping to enterprise

where where does where does it fit in

enterprises are always the slowest right

I think that um they

they clearly aren't going to um

use voice AI

until they have a lot of comfort with the fact that

the voice AI assistant is doing what you expect

so they're definitely gonna need people selling to enterprises

building voicing for enterprises

and enterprises are gonna need something like what we're building

but it's not really my my initial focus

just because it's so slow to get going with enterprises um

our customer base really is two different categories um

it's vertical SAS specialized vertical SAS

voice AI companies are focused on something like

building a voice AI assistant for um

HVAC businesses um

or they're the AI agencies um

and that's part of why I'm excited to be on this on this

um in the session with you is like

I think Sinflo does a great job with AI agencies and fitting

the needs of the AI agencies

um like with the white label product and that sort of stuff

and the AI agencies

you know the other type of our customer

and we find with both of the those two types of customers

the vertical SAS and our AI agencies

they kept coming to us saying hey

can this is cool

you built for us and it's helping us to hone our agent and refine it

but can you also build this

can you also expose it to our customers

um can you give them a login or can you

we embed what you built into our website so that um

they can see how the what

what's going on with the data

um so that's kind of the direction we're going with things now um

and making progress on that as well

I'm sorry that's kind of maybe a rambling answer

does that that's good

yeah that's really good

yeah I've got a bunch with a bunch of Ad Sheerans and head

so you getting the full white label route or is it more like semi

semi embettable sort of scenario

yeah so let me describe the current state of the world

and then also where we're going

um the current state of the world is you as the voice AI developer can

uh send us calls

um you send us recordings of the calls and a transcript

I mean go through later how to do the integration with make

and also the integration just with code

I'll points that as well um

then you can log in and see all the calls

for the different agents that you send us

in addition you can set up

you can work through with us to set up a login for your client

and your client can only see their agents

so we set it up in such a way that your client only sees

you know the agents are supposed to see

um

that's the current state of the world

then the next thing we're building is embedable components

so if you have a dashboard

like you're using the white label solution for Vincent flow

um Vincent flow

you can take

like the call flow diagram that I share with everyone shortly um

and you can bed that in the

the dashboard so you can show your client that um

as well and then finally um

this is in the works

but I'm talking to the the cofounders of Sinflow about um

doing a more native integration

that just makes this whole thing easier for

for you guys

and that there's not really the step where you have to mess around

they'll just be one click and you can you can get this

this stuff right there and sing flow

got it so I guess you able to trigger different kind of queries

depending on the outcome of the call

like in real time

or is it more of like a uploading batch and then analyze after say

like a campaign

it's a little bit both

so let me let me describe like the the main wait

so we can we can share the screen at any time

so whenever whenever you wanna add context

so you took your own we can do that I'll first describe it um

and then I'll show it and I'll summarize it again

so the the what I wanna describe is the first thing we show you is um

visualizations of what your calls are doing

so most of your calls will likely go down this happy path that

you designed um

but some calls will go off that happy path and take a turn

and those are the ones that you wanna look at

and they'll help you as the voice AI developer to improve the

the agent um

so and that requires that you log in and look at the data

and this is also what um the voice AI developers

customers also like they like being able to look in

and kind of see the different paths that the calls are going down

and get that kind of warm

trusting feeling that they can believe in this

this product and expand with it um

the other thing we do is we have uh

push notifications to slack around insights

so like right after a call ends

if it was supposed to be successful

if it was most likely to be successful

given the fact that of how long it was and what path

what call stages the call went to

but it wasn't successful we push those to you um

and make it easy for you to look at those

and see if you need to do any damage control or um

you know

you wanna make any changes to your prompts or your your integrations

um and then the last part is audio metric

so we we have metrics like um

the latency the number of interruptions and that sort of stuff

and that's really more for um

helping you surface different types of issues and dig

dig deeper into how the agents performing and also like

give reassurance to your clients like look

most of the time my latency is

you know this many milliseconds and that's within the

the human what's normal for human conversation

and things are working as expected

got it I guess

that kinda gives you the opportunity to be a bit proactive as well

so if you're already seeing

like a higher threshold of either foul calls

or ones going off the happy path

or highlight and see

then you could kind of proactively get in touch with your clients just

just to let you know

this is happening and we're on it and we're looking at fix

yeah so my last company

we helped farmers make irrigation decisions

and I think initially my first instincts

especially as a former academic that built this technology

and so close to me was like

if something went wrong I would like

I didn't want to go get the farmer's attention and bother them

and be like hey

this went wrong um

it just didn't feel right to me at the time

but then later I realized that

the key to our success was being very transparent with our customers

so we were always talking to our customers when we saw something good

and we wanted them to like notice how well this thing went

and also when something went bad

we reached out to them because it's better they hear it from us

and that they know that we're working on it

that's what prevents churn um

than if ended on their own

it's all about you know

maintaining and building that trust with your clients

and um

what what is the kind of state of monitoring and analytics tools

especially when it comes to third party

and so we do stuff in 10 minutes since low and

but I haven't checked in too recently on all the available tools

uh they

what what's the existing looking at right now

I think well

I think it

what the ecosystem looks like is indicative of the state of voice AI

right now voice AI is this emerging field

and there's some people getting a lot of traction in it

but not um

most people are still kind of in the early stages of it

so if you look at what other tools are out there for helping you

like third party tools for helping you develop your

your and understand how your assistant is doing there

mostly on the testing side

so the side before you deploy your agent into production

um it's for like

running simulations of lots of different scenarios

and making sure that um

your your agent is trying to automate that

testing that all of you are familiar with doing um

the the problem with that approach is like once you test it

it's great you can feel very confident in it

but there's no way to really simulate human behavior

like humans are the most unpredictable force in nature

and once your agent out into the production

and if you have agents in production

you know exactly what I'm talking about

like people do weird stuff um

and you're you're wouldn't expect what would happen there

and you're going to

really need to see what your agent is doing with real humans

to figure out how to improve it and um yeah

how to improve it so the

in the pre production side

there's companies out there like Vocera and Coval

um in the post production

there's us which is more of like this um

low code way of doing these things

um that

it's really amenable to working with AI agencies and your n customers

um or you can use more higher code solutions

like you can set up a whole

like data dog pipeline and build your own visualizations

uh and

and data flows through with data dog

but and

and the the problem with that is it's not really centered around voice

No. 1 it takes a lot of technical skills

and second there's a lot of problems with

voice that are different from just looking at like

traces from you know

text based lolms um

so I really think this area is kind of Mason and emerging

and we're seeing things emerge like what we're building

got it what's been

like the most common thing that people have found

after implanting your tool that they weren't aware of before

oh interesting

um

well I

think I once heard I was listening to a podcast recently of um

by C Partners and they're talking about what builds a great

what makes a great sass company

uh that really can capture a lot of the market like

like a gusto is that isn't that they do one thing really well

it's that they do a lot of little things

um and they do all of those myriad little things well

um and when I look at voice AI assistance

I can always kind of tell when I'm looking at the data

our customer sent us is like how

how much care and time the developer has put into developing

how much time it's been in production um

because there's always little things along the way that

it's just lots and lots of smooth

at rough edges that need to be smooth so

um finally I'll give examples

it's like a notorious one is um

I should actually more generalize first

there's all these workarounds you have to build

given the current state of the voice AI technology

with text to speech and speak to speech

text and lolms um

even with the audio only models and if you

you're not careful about how to do those workarounds

like how to get email addresses or um

how to be careful about like repeating people's names um

then you're gonna run into problems

um and it's really that sort of constant

careful hearing and craftsmanship that makes a great

voice AI agent I gave two examples there by mostly smoke high level

let me know if you want me to dig in more

no that's good no

I like it um

I think still very few people

I think really digging into digging into analytics

cause there's not too many doing a lot at scale

but presume that's gonna ramp up

now that they're getting more accurate

and they're getting easier to create and deploy so

so I expect that to increase soon

yeah so I'm looking at the chat

and I'm seeing that people are eager to see screen shares

and they're eager to figure out how to connect

especially Nick

I think Nick's very very keen to see a demo of screenshots

let's dive in and we can keep on

sharing my screen um

let's see

give me just a moment and any questions you go file them into

file them into Q&A we can

we can bring them up on screen

or you can even come up and ask yourself whichever

whichever you fancy

okay cool

you can sit if you wanna follow along

I'm gonna drop this in the um in the chat um

so here's the sequence of what I mean go through

I'm first gonna fairly quickly walk you through

our platform and what we're doing

and then from there

I wanna spend more time focusing on how you can integrate with

your sin flow agent with our platform

we have a free tier so it's it's easy to get started

I just put into the chat

also our blog post that really walks you through um

how to integrate with make or with code um

and that's what we're gonna cover now so so without further ado

let me go back to our dashboard

I'm on our demo account now um

and I when you first log in

you see the summary page

the summary page will show you how many calls you've uploaded

and the total duration of the calls you've uploaded

and some high level stats um

where things get interesting is when you scroll down

and you click on one of the agents that you've uploaded

and then we can dig in and look at that

that agent um

so I'm gonna scroll down here

and first gonna talk about how we process the data

you send us for each agent you initially send us excuse me

I should say it differently when you connect a new agent to us

we take the first 15 calls that you send us

and we use those first 15 calls to figure out the stages of the call

what I mean by stages is

an agent will typically go through stages like reading appointment

scheduling um

you know answering a question about a product uh

and then closing the call we figure out what those stages are

then once we have those stages every time a new call comes in

then

we can take each turn in the conversation and assign it to the stage

so when the user says hi

we can assign that to greeting when the user says

I wanna change my appointment

we can change um we can assign that to appointment scheduling

what this enables us to do is to determine

a path that each call goes down um

that all path is helpful in and of itself

but it gets really helpful when you can take that call path

and look at the call paths of all your calls for a given day or week

and really get the bigger picture of what your assistant is doing

um we do that automatically

you don't have to submit a core structure or anything

you can just get that from the course

well I'm really glad you asked that question

that was about to say is that we do it automatically

because some people have so many agents

they don't wanna go in there and edit each one

but also if I were logged into my personal account right now um

you could see a little edit button here

and you can edit the stages to better match

what you think they should be

got it okay

make sense

we do the same thing with the call outcomes

so we use the first 15 calls

and some LOL

magic to figure out what the typical outcomes are of the call

appointment not scheduled

appointment scheduled or product um

inquiry handled

then we use those outcomes

lolms are just like people in that they're more likely be accurate

if you break things down into small steps for them um

they're more likely to be able to solve a problem

when the problem is broken into small steps um

so we take those outcomes and then in another step we ask the LLM

was this a successful call given the outcome or not a successful call

um again

you can edit the outcome names in our dashboard

you can't see that here but if you log in um

you can see that uh

can you still see my screen by the way

uh I just opened your cash

okay

click on the sign up button here

um you can sign up for free and you get uh

I forget how many how many free minutes do you get

you get 4 hours of calls per month for free um

so you can click that sign up button um

and once you sign up you can click the upload calls button

we have a gooey for getting started

so you don't even have to integrate using make or um uh

code initially

you can just download calls from Sinflow and drop them in here

and that'll bring you back to what I was showing you earlier

where you have the uh

will bring you back to our dashboard

so this is it's really easy to get started and and try it out

um you just go to uh

I'll drop this in the the chat

you just go here and click sign up um

and you can try it out um

and upload calls

okay so now that we've uploaded calls I wanted to um

show you some of the things you get so this is the thing that my

the Voice AI developers

customers seem to like the most it's this call flow map um

I think it just gives people reassurance that they can see like oh

the voice the eye is doing what it's expecting

like initially greets the people and then after greeting them

some amount of calls go to objective not met

like yeah people sign out

um people hang up um

that's pretty normal um people hang up right away

but even that's something you can optimize as a voice

AI developer and work on different tricks um

but then the calls are getting you know from their partition

some are going to appointment scheduling

some are going to product enquiry handling

if we go to appointment scheduling

we can see that some go straight from appointment

scheduling to closing call

we can click on either a node

and see all the calls that went through appointment scheduling

or an edge between the nodes

and that'll bring out

a drawer where you can see the calls that went through this um

stage of um

greeting to appointment scheduling and what I'm looking at here

is a call where um it started with greeting

you can see the the stage and who was speaking is the assistant

it's a greeting then the user says

I'd like to schedule appointment

that turn that conversation turn was assigned to appointment

the appointment scheduling stage

and just gives you an easy way to kind of

look at your call volume and see like oh

I see what's happening um

everything's going as expected or something's not going as expected

this is gonna be an easy way for me to click in

and look in and see what's happening

and see what I need to change um hmm

can we pause in there yeah

that was that was really good

that was a really good walk through

so when let's say somebody's looking at this

this report

and they see there's something which has gone off the happy path

and they wanna dig in

how would you kinda recommend someone like evaluate these calls

just clicking in and read through the

read through the transcripts

and you can kinda tell where they've gone off

off path

yeah actually for that

so this first screen is really something that's like

for your quick view of understanding what calls doing

if you want to figure out how to improve your call

I actually like the call map more um

maybe that's a bias because I'm a little uh

it was the first thing we built in it

I think it's neat but um

with the call map

it's a similar thing where we're showing you the stages of the call

but what's different about the call map is we show you

will highlight in red

where most of the calls are stopping with failure

um

it makes it easier to subset to just the calls that are going poorly

the other thing you can do with the call map and

and the demo accounts not as good as an example

but you can see

you'll see something like most of your calls like 106 calls here

47 calls here 17 this is the happy path that most calls are going down

but you might find like this weird branch

and it's when you click on the branches that

that go off in an unexpected direction that you find the insights

you click on those and it will um

and if you click on the transcript part

it'll bring you to the part of the conversation

where it's going off the the rails

it will highlight in yellow

and it will give you ideas about what you need to improve

about your assistant um

we haven't

one of the things we want to do is like

start using diffusion models to turn this into like

a like a comic book panel

so it's quicker to read because reading transcripts is slow

but people can ingest that else of quicker

but we haven't built that yet yeah

we we have like a

the Ellen

Musk judge to just like evaluates the call that summarizes it

but again that's still uh

like a bit of a pain to read all this summaries

especially be doing any sort of significant cool volumes

this might be a bit like a out there question

presuming might be possible and I don't like can I guess features

etcetera but you can almost kind of extrapolate similar

this out to self healing and self recommendations through

call improvements

and maybe if somebody ships like an incomplete assistant

and maybe could use another couple of stages added on to it

you could you could probably even tell by

and having a decent bunch of calls running through your system yeah

that's exactly where we're going with this

so with I'm gonna pop over back to my dashboard

um it was a pretty scheduled comment below because I logged in

it was this thing

um I can't show it

but we have a feature where you can integrate um

with Slack

and what we're doing at Slack in sizes were showing you like

here's a call that went down a happy path

90% of your calls to go through all these stages

the happy path they complete and have a successful outcome

but here's a call that wasn't successful

but it went down the happy path

look at it um

and where we wanna go from there is to then

take those calls that should have been successful

because we've we figured out how to subset just to the calls that um

should have been uh

that are problematic rather than like manually sampling

um and we want to suggest improvements to people system prompts

um to

to help them like you know

figure out like oh

well these calls are going poorly because when the voice

AI repeats their name back

spells their name back it speaks way too fast

you gotta like prompt it to speak slower

um things like that

that we think we we wanna get to the vision that you describe Tom

where we can help kind of build

self healing systems with the human in the loop

hmm

yeah I'm wondering that

you can really pull some really good insights out of this

and instead having a data served up this way

like a visual for any decent sized company

you have to report on the effectiveness of these things

that's gonna be a really useful page to show them

yep

yeah um

so another thing I'd like to show is um

the custom metric so what a lot of people

cause is in line with what you're just talking about Tom

um you can ask custom metrics of call

so the summary is great um

but summaries

one line summaries LLM is a judge

they don't capture kind of what happens within a call

often they all think loss is over

this is a common problem I say Isaac Gloss is over

kind of the details that might happen in particular turns of the call

um and one of the ways to approach that issue is to like

look at call maps and it'll help you identify

the different stages that you need to focus on

and subset rather than random sampling

um another thing you can do is you can define custom metrics

like you may be interested in knowing um

your your voice

I should be asking in the US for consent to record the call um

and you might wanna ask like

did the assistant ask for consent to record the call

and will then for every call that comes in

will ask that question and will surface

you can click on the times it's false um

and we'll show you just the calls where that was false

you can look and be like why did the voice say I not ask for consent

you can see what the issue was and try to figure out how to improve it

um so just one more way to uh

improve your agent and it's also something that like

your customers might be interested in too

because they're like well

how can I get my voice AI assistant to sell better

um you know

how to handle this objection

did the voice AI handle the objection or something like that

yeah nice

there's a there's a couple of like

interesting quirks sometimes come up in assistant calls

one of them is sudden uh

sudden bigger bigger lacency gaps in between questions

so say if you have a script

everything's gonna read nicely

and then suddenly for some reason a question comes up

the user responds as normal

but then there's a big lag and then the user has to repeat themselves

and then maybe the assistant repeats themselves

and then it kinda gets through

is there a way to identify something like that in in your system

yeah so you're this is like

it's almost when I see this happen the most is around interruptions

where like uh

the the person the voice say

I will think the person has finished talking

it'll start talking and then the person will like wait for them

and it just like

the question ends up getting repeated and things get confusing

is that when it's like type of scenario you're talking about

or maybe darkness some sometimes it can be like a

like a rogue background sound which cuts the cuts the response out

but there is uh now and again

I don't know if this happens when you just uh

maybe listening to voice articles all day

so you're more attuned to it and then an old person won't notice

but sometimes you do find that now and again

there comes like a little bug where there's like a

nine out of the 10 questions that the assistant will use on the call

be absolutely perfect like normally it's expected

but then one of them for some random reason

uh just has like a

a double or triple the length of gap and it'd be good

this is maybe like a more minor

like fine tuning use case rather than a high level general one

but it would be good to be able to identify against photos

and I I sit when I'm doing recordings since doing videos like

cause I'm looking at the audio waves between the responses

so I can see the legacy gap on each response

sometimes they're most of the time they're even

but then you can see one

which is like double the length of the other one

uh yeah

I know what you're talking about so

um we calculate on the back end

we're calculating the latency

the for the human response and the latency for the AI response

and we're calculating

how much of the time the person is silent

when it's their turn to speak

which is indicative of confusion

and all these other audio metrics

we haven't surfaced them yet

we just have a um

there when you log in um

this is something we're shipping

hopefully today

is something where you can just download all those metrics

for your calls so you can see the latency for all your calls

but eventually

we wanna surface that and put it in what we call a rain cloud plot

where right now this is the call duration distribution

and you can see like most of the calls this frequency history

it's a frequency um distribution

um you can see most of the calls are around here

around 30 seconds um

and then the individual calls are these dots underneath um

and we wanna create a drop down here for latency

so you'd be able to see like

oh most of my calls have pretty low latency

but you could click on the ones with higher latency

and see what was going on with those

and see when they happen are they happening because of interruptions

or is there some way you can

might be able to improve the flow so that those

those late and sees are less likely happen

or like

there's a long wait and see and maybe you wanna go tell your client

like hey

this thing happened um

that's uh

but we're working on it it sometimes happens

it's okay most of the time your late and sees are over here

you know there's nothing to worry about

yeah sometimes it can be random things like uh

response length or sensitive length

or if they're doing some sort of rag over a knowledge base

which has like an awkward setup

then there's just like a bubbling legacy there too

if they're making like uh

it's easier custom action

which is like a laggy API and then they get a response back in time

um yeah

having that flag would be

would be actually awesome

especially at scale that'd be

that'd be killer

yeah I mean

what you just said speaks to the craftsmanship of building voice AI

is like you can uh

I think um

you know you can get something up and running in

in five minutes and that will get you 80% away

but the next 15% is gonna take you a lot longer and a lot of skill um

you know it's faster to get there if you have a lot of skill

but it takes you know

really understanding how these systems work and how to like

avoid these latency traps and

and other problems they can come up if that

you know if you're not designing your system well

and carefully monitoring and analyzing it

hundred percent there's a if anybody's got a new system flow

we can just cover what we're talking about a bit more is

if you're using something like custom actions

or if you have your system connected to Knowledge Base

if the user asked the question which isn't in the prompt

then it can go and look up through a document or a series of documents

Wikipedia and then bring that um

information back into the call

which may take slightly longer than a pre determined script

or another one

which you can do in sin flows and called a custom action

which is basically just like an API call

which you can either trigger to happen before the call happens

so uh

actually an example I like to use is if you wanna get the weather

so if you're doing anything with like an outside venue for instance

you wanna know the weather to determine what you recommend to the

end customer or if it's like a car service or like a concert

whatever it is

then you can bring that weather forecast into the conversation

you could probably do that before the call starts

then you avoid the mid call conversation

but if you're the mid call later see um

the way moment to cover that if you're doing it mid call

this could be something like

looking up an order ID

in a Shopify store to track where the shipment is

for instance that could be the customer gives the ID

you can look up the database to see the states of the package

and to cover that API call going through to to shopify

you will set in your prompts or in your field in sin flow

you can trigger it and say okay

I'm just gonna look up your information now

and while the this is the same

that sentence is making me a guy called the same time

so that then covers that look up hopefully that makes sense

that was a that was a great intro to um

knowledge bases and rag and tool calling uh

all all the right there very specific

thanks not being at the most advanced custom action developer

we've got you killers and simply does some amazing stuff

but that's as simple as I ha ha I can understand anyway

um story time how to integrate

yeah that's right

no that's it

I was just gonna get good and I'll be I'll be awesome okay

so one of the links I dropped in the chat was um uh

to this blog post that tells you how to connect your sin flow agent

using make um

and the place really just started it

it walks you through all the steps um

uh

but you can also download the scenario

like if you go into this blog post and scroll down

you'll find the scenario where you can just download it

and upload it into make so just backing up a moment um

make is a platform for um doing server side automations um

so it's like instead of writing the server code you can use make to um

take the the the data from a call and do something with it um

like you know send an email with it or

or whatever and we're using

you can use make also to send data to our platform um

so you log into make and the first step is to create a custom web hook

and this walks you through creating that custom web hook

and then you would then go into sin flow and go to your

your agent that you want to um uh

send the data from that the call data from that agent to our platform

and you click on deployment and rest API

and you would paste in that web hook that you got for make um

then the next step is I mean

the easiest thing is to just download the

the scenario from from our website right here

and then you upload that into um

into make and then the final changes they

all you need to do to finish the integration is you end up going into

here I'll go into my make and you have to um

it'll look like this once you upload your

your file into your scenario to make

and you have to put your API key in for canonical

and then it's it's ready to go

I mean I I blaze through that really fast

but it's that's just the bigger idea

you might be wondering where you get your API key to get your API key

you go to our website and you click on the little icon and there's um

the setup button and you can get your API key from in there

um and I don't know what maybe I went through that too fast time

should I slow down on part of that or does that makes

does that look good I think

I think make make integrations are probably a bit more like on the

on the niche technical side anyway

and it probably won't be one of the like beginner use cases for

for using make I think probably a lot of people are still at the

either just trigger a call by a make with some flow

or retrieving the details of the call by some society

to actually produce some more education materials

in this case maybe I can fire up the tutorial

yeah and our tutorial actually you might find it

people here might find it as a good way to get started with me

cause it's like a

walks you through an example that you can get up and running um

and and kind of acquaint yourself with the mix system um

we all in the chat is a monitor synth play voice yeah

and of course cool I'll let my drop this in the Academy too okay

yeah that'd be great um

and you can also we

I also linked to code on how to programmatically to our docks

excuse me so if you're if you're writing server code um

and you you want to integrate with

with our code and send data via the send the calls via the API

that's all in here

there's a tab for sin flow for hitting our sin flow and point um

so that's that's there

for people that are kind of interested in integrating with code

how granulate can you get with the make inspiration

can you

can you animate screen comprehensive when she passing Jason through

can you I guess then you can filter filter by um

if uh

there's like an unintended call which goes through depending on how

you labelling it or a failed call

and you could trigger a slight message too would that be right

yeah

one way if you pick up those dynamically created uh

call stages from the retrieval

so one way to do it I'm gonna start with

the way that kind of better maps to my way of seeing the world first

which is like just send all your calls

use this make um

scenario here to send all of your calls to our platform

and then use our platform to figure out the calls that you want to um

be alerted about and you want to surface

so you can set up the Slack integration for that um

you could create a more sophisticated

the other path is to create a more sophisticated make workflow

where you're using some of the information that you can get from um

the the end of the call

the events um

they get sent to the web the make web hook

and using that to decide what to send to our platform

and what to do other things with it

you know

um along the way I think that's what you're getting at right Tom

yeah yeah

pretty much nice

and one question I didn't wanna ask is when

I guess this is gonna start becoming important

when now everybody knows you can measure these calls

and we have the Synthro analytics

which are just Surf's level now we have this

which set them more in depth

what would we be able to at some point we like create a okay

downloadable report with like a snapshot of the

the last the months worth of course

to be able to send it to send it to a client or is this

this is the one wait

hold on

let me get back you guys should just share this with the client right

yeah

so that's that's what I was about to say is that you can give a login

we can set you up so that your client can login

and only see their calls here

and that way they get

you know the interactive experience rather than the static report um

and then as I mentioned earlier

we're working on I'm hoping that we'll have a native integration with

with Sinflow and that will be these sorts of charts

in the other analytics products

I've shown you along the way

like this stuff that that'll all natively be in

in your white label account um

and then like along the way if people want

you know

something that turns us into a PDF

because they have some old school customers that want pdfs

we can probably make that happen

just FAX it to them and let us all yeah

yeah nice

that's cool uh

so then you got any questions of what you've seen

any alternation folks

and maybe directed someone like Scott

who's very heavy on all the alternation side and like a high level

is this something you could work into your stack for clients

and then if there's any leftfield questions

where you voice AI in general

or questions to Tom about plans far

far away and then we can spend a few minutes answering them

yeah so while we wait for any questions come through

is there anything else that's interesting

um that you're looking at Invoice AI that's kind of separate to this

it could be

um pure racing new models

it could be languages it could be latency related things

what what else is this space you interested in

um you know

it's always kind of tempting to talk about like the

the latest demos that have come out that showcase the

the cutting edge abilities of voice AI and multimodal AI

um and a lot of ways I think that's a little bit

it's just it's not as um

it's almost taking our eye off the ball

the what's really happening in the world right now is we're in

um

we're used to this idea that you can do amazing things with lolms

but some of you might have noticed this during the holidays

that like the Thanksgiving holiday

or you'll notice it at the upcoming holidays

most people that you interact with

maybe they have a chatty BT account and all they've said to it is

hello they're like

we're all living in the future

and we don't realize how much opportunity there is just to

bring people that aren't in the future that we're living in right now

up to where we are now and there's so much money and opportunity in

helping the world go from where it is now

just to the current state of technology

um with these interactive voice assistance

um and uh

I think part of the reason we don't realize is

I'm constantly talking to people like you Tom

or the people in this in this chat

um are in the session that like are living in that future already

but I think that we I think what's really exciting is just like

the opportunity that's here in front of us with with Sinflow

yeah true

you do you kinda get

uh wrapped up in everything that moves so fast

and that curve has been so violent that you do realise that everyday

semi news coming out but still people are still catching up

so that's a that's a great point

we have a question through through

net does it

does it email a report

say once a week or notify errors as they happen for real time alerts

so we have the current notifications we have are the Slack um

Slack alerts um

and uh

that's they'll tell you when a call was

a long call that typically finishes in this success

but failed or if it was a call that was on the happy path

that normally finishes in success

but fails um

we haven't built uh

any other sort of like email reporting or other types of alerts now

um I'd love to hear what you'd like to have built Nick

um you know

it's a fun stage of the company we're at uh

where it's it's just my co founder and me at the stage

and we're working closely with our customers on getting feedback and

you know we love getting to hear and build for things

build things for people

I wonder if that be like a

a good way I think it's pretty pleasant info too

just saying if like a certain amount of cool volume

then kind of falls outside of the norm

in terms of computer core ratio or something

that she just pulls the assistant

um trying to get like that feedback you

uh yeah

that would work really nicely for an outbound agent

right when you have a little bit control over um

you know you know

it's okay

it would be okay to turn it off for a little bit to make some changes

yeah run

just pumping out loads pumping out loads of calls

and realize that 5,000 of them have gone crazy

yeah yeah haha

um chain just asked about booking a call with us

I just dropped my calendar in there for anyone that like to meet me

and also I'd love to connect with people on LinkedIn

I just like seeing people build right

cool demos on LinkedIn of what they're building

and it's like LinkedIn it went from being like this boring thing

I hated being on Twitter when I was running my egg type company um

so like it's kind of a trap for me now

I spend way too much time on LinkedIn and Twitter

just because I get to see like

the coolest stuff that people are building

so I'd love if people um

oops that's the wrong link uh

would connect with me on LinkedIn

Twitter so I can stay up to speed on what they're building

amazing

and uh

any other thoughts just from

you can like wrap up after this if anyone's got any more questions

any more thoughts than just general like AI applications

uh is there any

no pretty much time for side projects

but is there any

anything like hobby related that you're playing around with

it could be video generation images

language translators language learning

know that with my kids

could be spending a lot of time with stuff like Synthesis

Tutor for math which is going really well

so any of those types scenarios you earned

I'm sorry I missed that

are you asking me or are you asking the community

should I answer at first I say to you

I'm the Zike community I see if anyone's got any good ones

um

yeah so one of the things I'm really interested in now is

how do you get emails from voice AI

they they

they bubble it all the time

and I have I have this API that um

I just shipped yesterday but still uh

last night actually but I haven't built the testing around it

but it's basically a way to accurately get an email from a caller um

and what we're doing with it is

we're just doing a lot of data processing to um

extract the the

the proper noun the spoken letters um

and hand that back I think um

I don't know I find it to be a really neat problem

it sort of goes back to this idea that I think

just voices and neat space with so many neat problems

and like even this one's kind of a mundane problem

like extracting an email from a call

but like it's just been really fun to work on um

I'm excited for getting to the point where I can start demoing it um

and hopefully maybe even integrate it with something like sin Flow

that would be useful

because that is a current paying point that requires a lot of um

creative prompting

but attention to detail and it's if you don't explain it well

then a lot of people do slip up

especially when they wanna capture customers

emails and then they come through wrong

so yeah plus 1 for that

yeah yeah

that's neat and the other stuff I just saw um

a post by um Quinn and daily

he's a friend of mine and he use Gemini's multimodal to um

basically I think show us what the

the future is gonna look like

where you're talking to a computer

and the computer is able to see what's on your screen

and it's going to respond to you in text and voice and video um

and I think as humans we

we find it easiest to speak um

but we ingests information with our ears at a slower rate

listening than we do for visually

we're like visual creatures

so I just think it

we're gonna have this future relatively soon around the corner

where people are talking to machines

and machines are talking and back

as well as showing us information back um

and I I think that's a I think that's really neat

I mean I I start talking to my computer too

with Whisper Flow and have really enjoyed yeah

I use Whisper Flow all day every day

yeah

basically doubles the amount that doubles your speed of like output

so any text box is there you can just reply twice as fast

so like Slack replies email replies like DMS yeah

it's incredible

yeah so you're mostly using it for email and Slack

is that the main thing you're using for

um

sometimes I do it for uh like notes

so I wanna take a quick note

I use that um pretty much anywhere actually

sometimes I will use it uh

this weekend or just gone

I was using it in a bolt dot new by stack blitz

and also lovable and it was just building

so while I was building I was doing something else and it comes back

I'll prompt it again with with just my voice and then just go off

so yeah I actually have to type

so you're building software just with your voice

now what what platforms are you doing that with

so one is called bolt.new B O l t.net

oh yeah yeah

uh huh yeah

by staclets and the other one is called lovable L O v a B l e dot gov

I haven't heard of lovable yeah

I've been hearing lots of actually

yeah and another one if anyone's interested in uh

it's the one I keep you interested in NYC

yeah there's another one that she which she may be more interested in

she's open source

works with Llama 3.3 centimetery called Cerebros Coda

um I can share the link in the chat for everybody still here

this will build things in under 1 second which is kinda really good

yeah the whole space is moving very fast and we got lots

lots of stuff to play with

so yeah voice AI is a super cool space to be in right now

I came to said final comment

uh agree on the problem one letter missing an email is a complete void

so we focus only on the right phone number

and then do post after post alternation

getting emails right

in a new flow we always

focus on getting the phone right during the call

caller ID and confirmation by voice

yeah

yeah

nice

nice well

I think it's pretty good place anything

anything to add Tom anything you wanna at the end

no I just wanna thank everyone for taking the time to

to listen to me and hear

listen to my opinions about what's happening in voice and

and give

give me the opportunity to tell them about what we're building uh

really means a lot to me thank you

of course I appreciate you taking the time to share with us

and maybe soon we'll see an inspiration inside the synth life as well

yeah that'd be great looking forward to it

hopefully we can figure that out nice

thanks Tom thanks everybody and see you back inside the community

so you posted straight after the call

the replay and posting up on YouTube too if you wanna see

thanks folks catch you on the next one