diff --git a/404/index.html b/404/index.html index b52b87a5..f209c2bc 100644 --- a/404/index.html +++ b/404/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/Code-Mixing-Metrics/index.html b/Code-Mixing-Metrics/index.html index fd208f4a..2cd07e6c 100644 --- a/Code-Mixing-Metrics/index.html +++ b/Code-Mixing-Metrics/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/Code-Mixing-Seminar/index.html b/Code-Mixing-Seminar/index.html index aad9aec1..4f20943f 100644 --- a/Code-Mixing-Seminar/index.html +++ b/Code-Mixing-Seminar/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/Turn_Taking_Dynamics_in_Voice_Bots/index.html b/Turn_Taking_Dynamics_in_Voice_Bots/index.html index fdd88382..61a6038b 100644 --- a/Turn_Taking_Dynamics_in_Voice_Bots/index.html +++ b/Turn_Taking_Dynamics_in_Voice_Bots/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/about/index.html b/about/index.html index ab683d74..8b1915ee 100644 --- a/about/index.html +++ b/about/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authentication-in-grpc/index.html b/authentication-in-grpc/index.html index e64fc68d..5e53e468 100644 --- a/authentication-in-grpc/index.html +++ b/authentication-in-grpc/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors-list/index.html b/authors-list/index.html index e18412cd..8831d3af 100644 --- a/authors-list/index.html +++ b/authors-list/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/Shahid/index.html b/authors/Shahid/index.html index ae7856fd..bae5fd80 100644 --- a/authors/Shahid/index.html +++ b/authors/Shahid/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/Shangeth/index.html b/authors/Shangeth/index.html index 5f5daba1..ea800ff7 100644 --- a/authors/Shangeth/index.html +++ b/authors/Shangeth/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/Shashank/index.html b/authors/Shashank/index.html index f5f5192e..b9d9ebba 100644 --- a/authors/Shashank/index.html +++ b/authors/Shashank/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/anirudhdagar/index.html b/authors/anirudhdagar/index.html index 1f84eb5d..fe6ac86e 100644 --- a/authors/anirudhdagar/index.html +++ b/authors/anirudhdagar/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/deepankar/index.html b/authors/deepankar/index.html index be5b5963..c3cdb015 100644 --- a/authors/deepankar/index.html +++ b/authors/deepankar/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/greed2411/index.html b/authors/greed2411/index.html index 8da9607d..8dc132d6 100644 --- a/authors/greed2411/index.html +++ b/authors/greed2411/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/janaab11/index.html b/authors/janaab11/index.html index 50ba7005..7836a039 100644 --- a/authors/janaab11/index.html +++ b/authors/janaab11/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/kritianandan/index.html b/authors/kritianandan/index.html index 03ece0d0..ca1b38e0 100644 --- a/authors/kritianandan/index.html +++ b/authors/kritianandan/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/lepisma/index.html b/authors/lepisma/index.html index f680fa4a..c70d521d 100644 --- a/authors/lepisma/index.html +++ b/authors/lepisma/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/mithun/index.html b/authors/mithun/index.html index dc518a1e..f4799a57 100644 --- a/authors/mithun/index.html +++ b/authors/mithun/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/ojus1/index.html b/authors/ojus1/index.html index 24cab835..b8871ad9 100644 --- a/authors/ojus1/index.html +++ b/authors/ojus1/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/prabhsimran/index.html b/authors/prabhsimran/index.html index 3cbb9e70..10e19c3e 100644 --- a/authors/prabhsimran/index.html +++ b/authors/prabhsimran/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/sanchit-ahuja/index.html b/authors/sanchit-ahuja/index.html index 364033b7..87fd4343 100644 --- a/authors/sanchit-ahuja/index.html +++ b/authors/sanchit-ahuja/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/shantanu28sharma/index.html b/authors/shantanu28sharma/index.html index 580a5bff..03410d2c 100644 --- a/authors/shantanu28sharma/index.html +++ b/authors/shantanu28sharma/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/shikharmn/index.html b/authors/shikharmn/index.html index a220da30..5485a73d 100644 --- a/authors/shikharmn/index.html +++ b/authors/shikharmn/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/authors/swarajdalmia/index.html b/authors/swarajdalmia/index.html index a7cc3ad6..30f2b754 100644 --- a/authors/swarajdalmia/index.html +++ b/authors/swarajdalmia/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/bad-audio-detection/index.html b/bad-audio-detection/index.html index 6bf550b1..fda7e791 100644 --- a/bad-audio-detection/index.html +++ b/bad-audio-detection/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/buy-me-a-coffee/index.html b/buy-me-a-coffee/index.html index 1f174944..f7753edf 100644 --- a/buy-me-a-coffee/index.html +++ b/buy-me-a-coffee/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/careers/index.html b/careers/index.html index 35a3ec0b..0b3ca411 100644 --- a/careers/index.html +++ b/careers/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/categories/index.html b/categories/index.html index d9008adb..12f16369 100644 --- a/categories/index.html +++ b/categories/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/complexity-of-conversations/index.html b/complexity-of-conversations/index.html index 5bec0de0..46110153 100644 --- a/complexity-of-conversations/index.html +++ b/complexity-of-conversations/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/confidence-calibration/index.html b/confidence-calibration/index.html index f97f38dd..1ed2c452 100644 --- a/confidence-calibration/index.html +++ b/confidence-calibration/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/contact/index.html b/contact/index.html index 6a4ab8f4..abe1e688 100644 --- a/contact/index.html +++ b/contact/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/contextual-slu/index.html b/contextual-slu/index.html index 857e71c7..2dcb95a1 100644 --- a/contextual-slu/index.html +++ b/contextual-slu/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/emnlp/index.html b/emnlp/index.html index be624444..56713b07 100644 --- a/emnlp/index.html +++ b/emnlp/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/end-of-utterance-detection/index.html b/end-of-utterance-detection/index.html index a0169c5d..34944c28 100644 --- a/end-of-utterance-detection/index.html +++ b/end-of-utterance-detection/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/engineering/index.html b/engineering/index.html index c0db39c4..199a97a6 100644 --- a/engineering/index.html +++ b/engineering/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/evaluating-an-asr-in-a-spoken-dialogue-system/index.html b/evaluating-an-asr-in-a-spoken-dialogue-system/index.html index 172ed9af..c39a5fc9 100644 --- a/evaluating-an-asr-in-a-spoken-dialogue-system/index.html +++ b/evaluating-an-asr-in-a-spoken-dialogue-system/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/explore/emotional-tts/index.html b/explore/emotional-tts/index.html index a9c83a32..309c0d99 100644 --- a/explore/emotional-tts/index.html +++ b/explore/emotional-tts/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", @@ -507,6 +507,8 @@

Emotional TTS

+

This is a work from 2021

+

Using our rich Emotional TTS system, we deliver the right tonality and superior customer experience in our dialog systems. This page showcases a sample of emotional presets and variations from our synthesizer.

diff --git a/explore/index.html b/explore/index.html index 06ea1434..379dd812 100644 --- a/explore/index.html +++ b/explore/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/explore/natural-tts/index.html b/explore/natural-tts/index.html index 71349067..a62ccd32 100644 --- a/explore/natural-tts/index.html +++ b/explore/natural-tts/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", @@ -507,6 +507,8 @@

Natural TTS

+

This is a work from 2021

+

Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS.


diff --git a/explore/speaker-entrainment/index.html b/explore/speaker-entrainment/index.html index 5ca73960..cb6d58f1 100644 --- a/explore/speaker-entrainment/index.html +++ b/explore/speaker-entrainment/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", @@ -507,6 +507,8 @@

Speaker Entrainment

+

This is a work from 2021

+

Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor.

This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate.

diff --git a/explore/voice-cloning/index.html b/explore/voice-cloning/index.html index 4c516e3b..88ba1419 100644 --- a/explore/voice-cloning/index.html +++ b/explore/voice-cloning/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", @@ -507,6 +507,8 @@

Voice Cloning

+

This is a work from 2021

+

Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below.

diff --git a/fast-microservices-with-grpc/index.html b/fast-microservices-with-grpc/index.html index 270b78bc..599a3486 100644 --- a/fast-microservices-with-grpc/index.html +++ b/fast-microservices-with-grpc/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/feature-disentanglement1/index.html b/feature-disentanglement1/index.html index 1f6d6e88..c2e3b992 100644 --- a/feature-disentanglement1/index.html +++ b/feature-disentanglement1/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/feed.xml b/feed.xml index 39d06a01..6feec17c 100644 --- a/feed.xml +++ b/feed.xml @@ -1,4 +1,4 @@ -Jekyll2024-05-09T19:47:06+00:00/feed.xmlSkit TechSpeech Technology from SkitSpeech LLMs for Conversations2024-05-09T00:00:00+00:002024-05-09T00:00:00+00:00/speech-conversational-llms<p>With LLMs making conversational systems has become easier. You no longer need to +Jekyll2024-05-09T19:50:34+00:00/feed.xmlSkit TechSpeech Technology from SkitSpeech LLMs for Conversations2024-05-09T00:00:00+00:002024-05-09T00:00:00+00:00/speech-conversational-llms<p>With LLMs making conversational systems has become easier. You no longer need to focus on the low-level details of categorizing semantics and designing responses. Instead, you can concentrate on controlling high-level behaviors via an LLM. This is the trend that we see most of the world moving towards as diff --git a/gsoc-2022/index.html b/gsoc-2022/index.html index ee163432..0f276896 100644 --- a/gsoc-2022/index.html +++ b/gsoc-2022/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/index.html b/index.html index 76a4b08b..cdcb8f11 100644 --- a/index.html +++ b/index.html @@ -142,7 +142,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -167,7 +167,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -182,7 +182,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -192,7 +192,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/interspeech/index.html b/interspeech/index.html index 81533d40..9bd461c6 100644 --- a/interspeech/index.html +++ b/interspeech/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/label-noise-intro/index.html b/label-noise-intro/index.html index 37f5c9dd..23ccf8c8 100644 --- a/label-noise-intro/index.html +++ b/label-noise-intro/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/ml/index.html b/ml/index.html index 3157e566..2ae6d907 100644 --- a/ml/index.html +++ b/ml/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/new-blog/index.html b/new-blog/index.html index 9b51eb49..455a9324 100644 --- a/new-blog/index.html +++ b/new-blog/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/normalizing-flows-part-2/index.html b/normalizing-flows-part-2/index.html index 72ecdf6f..65ea2ab8 100644 --- a/normalizing-flows-part-2/index.html +++ b/normalizing-flows-part-2/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/normalizing-flows/index.html b/normalizing-flows/index.html index aaad7dc8..3c59253d 100644 --- a/normalizing-flows/index.html +++ b/normalizing-flows/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/on-using-asr-alternatives-for-a-better-slu/index.html b/on-using-asr-alternatives-for-a-better-slu/index.html index c723c1b1..69b731d1 100644 --- a/on-using-asr-alternatives-for-a-better-slu/index.html +++ b/on-using-asr-alternatives-for-a-better-slu/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/page2/index.html b/page2/index.html index 23ee385d..7dcf42af 100644 --- a/page2/index.html +++ b/page2/index.html @@ -143,7 +143,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -168,7 +168,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -183,7 +183,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -193,7 +193,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/page3/index.html b/page3/index.html index 22abed4b..50c504bd 100644 --- a/page3/index.html +++ b/page3/index.html @@ -142,7 +142,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -167,7 +167,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -182,7 +182,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -192,7 +192,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/privacy-policy/index.html b/privacy-policy/index.html index 0c1ab595..c8b50b23 100644 --- a/privacy-policy/index.html +++ b/privacy-policy/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/reading-sessions/index.html b/reading-sessions/index.html index 8e23e1d5..bf37cc4c 100644 --- a/reading-sessions/index.html +++ b/reading-sessions/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/repl-conversations/index.html b/repl-conversations/index.html index b812adbe..8bcb4709 100644 --- a/repl-conversations/index.html +++ b/repl-conversations/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/resources/index.html b/resources/index.html index 7ed967f9..5bac0930 100644 --- a/resources/index.html +++ b/resources/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/speaker-diarization/index.html b/speaker-diarization/index.html index eb981461..7dff4254 100644 --- a/speaker-diarization/index.html +++ b/speaker-diarization/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/speaker-entrainment/index.html b/speaker-entrainment/index.html index e3bc0c94..3c8eeaf0 100644 --- a/speaker-entrainment/index.html +++ b/speaker-entrainment/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/speech-conversational-llms/index.html b/speech-conversational-llms/index.html index 39ad3577..a366ed5a 100644 --- a/speech-conversational-llms/index.html +++ b/speech-conversational-llms/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/speech-first-conversational-ai-revisited/index.html b/speech-first-conversational-ai-revisited/index.html index 4f0b48c6..36f55608 100644 --- a/speech-first-conversational-ai-revisited/index.html +++ b/speech-first-conversational-ai-revisited/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/speech-first-conversational-ai/index.html b/speech-first-conversational-ai/index.html index 0e12c04a..f0451a1a 100644 --- a/speech-first-conversational-ai/index.html +++ b/speech-first-conversational-ai/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/tags/index.html b/tags/index.html index bdcdcd4e..5a4f03ca 100644 --- a/tags/index.html +++ b/tags/index.html @@ -141,7 +141,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -166,7 +166,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -181,7 +181,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -191,7 +191,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/theory-of-mind/index.html b/theory-of-mind/index.html index 3dd82b83..00197238 100644 --- a/theory-of-mind/index.html +++ b/theory-of-mind/index.html @@ -144,7 +144,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -169,7 +169,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -184,7 +184,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -194,7 +194,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/whats-new-kaldi-serve-10/index.html b/whats-new-kaldi-serve-10/index.html index 35322932..23a6d1b7 100644 --- a/whats-new-kaldi-serve-10/index.html +++ b/whats-new-kaldi-serve-10/index.html @@ -146,7 +146,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -171,7 +171,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -186,7 +186,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -196,7 +196,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/", diff --git a/woc/index.html b/woc/index.html index e338f977..0cf4c5ab 100644 --- a/woc/index.html +++ b/woc/index.html @@ -143,7 +143,7 @@ "id": 7, "url": "/explore/emotional-tts/", "title": "Emotional TTS", - "body": "Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " + "body": "This is a work from 2021 Using our rich Emotional TTS system, we deliver the right tonality and superiorcustomer experience in our dialog systems. This page showcases a sample ofemotional presets and variations from our synthesizer. Reference Audios: These are audio files from the training data. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Your browser does not support the audio element. Synthesized Audios: These are audios synthesized from Skit’s Emotional TTS system. Audio 1 : The swan dive was far short of perfect. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 2 : The beauty of the view stunned the young boy. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 3 : Two blue fish swam in the tank. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 4 : Her purse was full of useless trash. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. Audio 5 : The colt reared and threw the tall rider. Neutral Happy Sad Angry Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Excited Apologetic Fear Surprise Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Your browser does not support the audio element. Calm Your browser does not support the audio element. " }, { "id": 8, "url": "/engineering/", @@ -168,7 +168,7 @@ "id": 12, "url": "/explore/natural-tts/", "title": "Natural TTS", - "body": "Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Our TTS has the state of the art results in Naturalness. Below, we present some example audio’s generated by our TTS as contrasted against Google-TTS. Conversational Tonality: Our TTS has a very distinct conversational tonality that differentiates it from other TTS vendors in the industry. Example Audio’s: Text 1 : The whole family gathered around the computer waiting for my sister to come home. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Hi, what can i do for you today ? Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : You will be surprised to know what he did yesterday ! Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : Can you please pass me the spoon ?. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filled Pauses: Filled pauses are articulations like “umm”, “uh” etc which occur very commonly in human conversations. Out TTS supports filled pauses and generates very natural sounding audios. Example Audio’s: Text 1 : Uh, i agree with you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Um, can you just give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I can, uh, yes, i think i can finish this. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Filler Words: Filler words are amongst the most commonly used words in any language. Example words in english are “okay”, “alright”, “you know” etc. These don’t necessarily add any content to the utterance, but have different functional, paralinguistic importance. These often also a specific intonation that are reflective of filler words which is supported by our TTS. Example Audio’s: Text 1 : Okay. I can do that for you. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : Alright. that sounds good. Please give me a second. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : Okay, so, like have you heard of the new company that he started. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 4 : You know, I never thought this would happen. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Ellipses: Ellipses are signified by 3 dots(“…”) and are used often as markers of hesitation, thinking or breaks in conversational flows. Example Audio’s: Text 1 : Can you just . . . hold for a minute Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : I am unsure what to choose . . . i think i will go for this one. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : I am going to . . . play tennis today. Skit Google Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 13, "url": "/privacy-policy/", @@ -183,7 +183,7 @@ "id": 15, "url": "/explore/speaker-entrainment/", "title": "Speaker Entrainment", - "body": "Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Speaker entrainment is a phenomenon observed in human-human conversations where one interlocutor attunes their speech’s acoustic, lexical and semantic features to the other interlocutor. This project aims to create a bot which can entrain on the acoustic features of user’s speech. Incorporating such behavior into bots is known to increase trust, naturalness and likeability, which is likely to increase customer satisfaction and call resolution rate. Baseline Module: The following audio samples are generated from the Baseline entrainment module, which entrains over pitch (fundamental frequency), intensity (loudness) and rate of articulation. Demo Audio Samples: Script-1: Entraining over pitch (fundamental frequency) in this audio sample, entrained performs better. In this script, the pitch of the user is rising and the bot attunes itself to that. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-2: Entraining over rate of articulation in this audio sample, entrained performs better. An excerpt from a user-bot interaction is provided here, where in the entrained version, our bot increases its rate of articulation according to the user. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. Script-3: Entraining over intensity in this audio sample, the non-entrained performs better. In this script excerpt, the pitch rises but that is a result of the user being angry since the bot does not understand him, among other factors. The bot becoming louder in response is very detrimental to call quality, which is why the entrained bot performs worse in this case. Not Entrained Entrained Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 16, "url": "/tags/", @@ -193,7 +193,7 @@ "id": 17, "url": "/explore/voice-cloning/", "title": "Voice Cloning", - "body": "Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " + "body": "This is a work from 2021 Presenting some results of the Beta Feature of Voice Cloning that we currently support in our TTS. We trained our model on 30 mins of recordings from a new speaker. Some examples of both the target and generated audio are added below. Example Audios: Text 1 : It is of the first importance that the letter used should be fine in form. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 2 : The general solidity of a page is much to be sought for. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. Text 3 : The two yards were adjoining, that for the common side much the largest. Target Generated Your browser does not support the audio element. Your browser does not support the audio element. " }, { "id": 18, "url": "/authors/anirudhdagar/",