-
Notifications
You must be signed in to change notification settings - Fork 108
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Want to suggest a wake word? Leave your thoughts here. (AIS-1441) #88
Comments
The Willow team and community would love "Hey Willow". It's our domain name because we've been waiting for this. Thank you very much for offering this option, it's very exciting! |
I'm glad you like this. Since "hey" and "hi" sound pretty similar, sometimes people might not really notice the difference. So, I was thinking, maybe we could support both "hey willow" and "hi willow" for waking up the device. That way, whether you say "hey willow" or "hi willow", it'll still work. Of course, when we release the wake word model, we'll call it like "wn9_heywillow". What do you think about that? |
Good idea! My only concern would be overall reduced accuracy (wake reliability vs false wake). We've noticed quite a bit of false wake with Alexa. From what I've read the automated TTS approach has 90-95% the accuracy of the models trained on human samples. I like "two word" wake words because they tend to improve accuracy, I suspect a 100% "Hey Willow" wake word could result in equivalent or even improved accuracy with the TTS approach vs even human sample trained Alexa? Of course we could always test this, even starting with a pure "Hey Willow" model, a pure "Hi Willow" model, and a merged model. Thanks again for offering this! |
Your concern may indeed happen. We will generate two words and test which model performs better. |
"hey/hi willow" model: Test dataset description: |
Guys, what you are doing is really great. We have created a smart speaker called Homai based on the esp32-s3. We trained the model ourselves, but it is resource-intensive and not so easy to integrate into the pipeline. Could you please add support for our word Homai [ho'mai]? Thank you in advance! |
Hi @AigizK , |
Hi @sun-xiangyu |
I'm sorry that our TTS model cannot specify a syllable to extend its pronunciation at the moment. This means that we cannot generate a large number of accurate “homa ai” phrases. |
Hi! Thank you for this awesome solution! We are developing a smart voice assistant called Sophia. Would it be possible to have the wake word "Hi Sophia"? This would help our user experience drastically. Thank you in advance! |
Hi @PrathamG , I'm glad you like it. "Sophia" sounds like a wake word that can be used directly. I mean, maybe we don't need an extra prefix "Hi". I suggest we start with just "Sophia". If the performance is not satisfactory, then we can train another one with "hi Sophia". What do you think? |
Sure, that sounds like a good plan! We can use only "Sophia" and test the performance first. Thank you |
If possible, I also wanted to request the wake word "Little Sophia". We are still unsure about which wake word to use, and having both options will help us determine this via user testing. |
Now our computing resources are limited. This project can generate about two wake word models in a month. So we will choose some popular wake words. Of course, if we have some free time, "Little Sophia" is also fine. |
No worries, totally understandable! Looking forward to testing out the "Sophia" wake word |
"Sophia" model: wn9_sophia_tts FAR(False Alarm Rate): 1 times / 8 hours |
“小美” or “小美同学” would be a perfect choice. It will suit a lot of use case. We all want wake word like a human name. |
@xygh, “小美同学” sounds good. |
Thank you! We will test it out and report the results by next week |
BTW, “你好小美” is also a perfect choice. |
"小当家" or "Hi 小星" is preferable wake word in our scenario. Thanks a lot! |
The second version "Sophia": Perfromace: Improvement: |
Both of these words sound good. If you have no preference, we will choose "hi 小星". |
"小美同学" FAR(False Alarm Rate): 1 times / 8 hours |
Hello! This is a great opportunity I was hoping would come up, I'm so glad this is now possible! I've seen that the wake-words "Mycroft" and "Hey, Mycroft" are very popular in the community, and it is also the name of my product so would very much improve user experience. Would it be possible to have either of these trained and released for the community? Thank you so much in advance for this! |
@lewardo, I'm glad it could help you. Although "Mycroft" is simpler, it seems there are quite a few words that sound similar, so I'll prioritize training with "Hey Mycroft." |
Deployment in the cloud is a bit difficult for us and is not in our plans. I think esp-dl might be a solution, if you want to deploy a model of your own, I recommend you to use it. |
Looks like there is already a wake-up word in English: Hey,Wand. Can we have a Chinese version? e.g. 神奇魔仗 |
We are using Espressif's ESP32S3 chip to create a small wizard dialogue toy that can provide great emotional value and companionship. |
Sounds great, I'm happy to help train a "Hi,小巫" wake word. |
We are working on a patient-side voice assistant for the healthcare space. We desperately need help training the English branded wake word, "Hey, Henry". We are currently testing with the ESP-BOX-S3. Many thanks in advance. |
|
Hi,小巫: wakenet9l_tts2h8_Hi,小巫_3_0.639_0.642 Perfromace: This is the first model trained by TTS V2.0 pipeline. |
Hello! We are working on a duck-shaped interactive installation, deployed in the snow landscape, responding to pedestrians' voice with a glowing light. The project will be open-source; we have completed most of the work and will be publishing details on GitHub and Hackaday quite soon (I will update this comment with the links then). We would really appreciate the addition of the wake word "小鸭小鸭" (xiǎo yā xiǎo yā). We are also willing to help if there is anything we can do to bring it to life before snowfall ^ ^ |
@ayuusweetfish, |
Hello there. We are working on an open-source cosplay prop called RinaChanBoard that supports showing expressions on a led-pixel-based screen. We've implemented an app with functions like BLE control, video-playing, music-with-expressions, voice-control and so on. However, the voice-control part is not flexible enough, so we're planning to migrate to ESP-SR. Videos【天王寺科技】璃奈板的制作过程 Source Codehttps://github.com/Spartan859/RinaChanBoard (This is an archived version. As we've encountered plagiarism for commercial usage, the newest version is currently private. I can provide permissions on your request.) WakeNet ModelsWe would really appreciate if you can train models as followings:
Thanks for all your efforts in publicizing ESP-SR, which provides me hope on making progress on my project. |
@Spartan859 , |
@sun-xiangyu If true, then we would happily accept りなちゃん/LinaJon as the wake word. Otherwise, just use 璃奈板 as the wake word, and we would give up using the Japanese one. We're grateful for your help! |
The performance of sub-phrase is unpredictable, which means it may be difficult to wake up. According to your requirements, it is best to choose 璃奈板 as the wake word. |
@sun-xiangyu |
@ayuusweetfish Perfromace: |
Thank you! It works wonderfully! I have made the project repository public: ayuusweetfish/Yun-Ying-Ya. It is still WIP; I will make sure to post more details over the next few weeks ^ ^ Thank you for your quack help! |
请问下我该去哪里下载这个唤醒词? |
在当前头部提交的这个目录。Registry 上还没有发布新版本,所以我暂时把这里的几个文件放进工程根目录下 In this folder at the current head commit. The new version has not been published on the Registry, so I temporarily placed these files in the |
Yes, as @ayuusweetfish mentioned, you can find the wake word model you want in wakenet_model folder, then overwrite the model you were previously using, and it will be ready to use. |
@Spartan859 Perfromace: |
谢谢你的回复 |
明白了 谢谢 |
请问下您这个唤醒词支持adf里面替换吗,目前有用到adf的唤醒 |
当然可以,adf 也是用esp-sr进行唤醒 |
期待您帮助训练以下唤醒词。 |
“Hi,春风“ 也不错 |
哈哈,让我想起了《剑来》,不错,就是在冬天的时候喊,怪怪的。 |
您好,可否帮忙训练一个叫“小酥肉”的唤醒词?我正在用ESP32S3开发一个面向儿童、学生的语音助手(也支持成人使用),已经接近完成,问了下大家都非常喜欢和期待“小酥肉”这个名称,如果可以使用这个名称,会对提升产品效果有很大的帮助。非常感谢~~ :) |
Although we have implemented some optimizations, children's voices is still a challenge to our current TTS wake word model. |
哦,补充下,不是那种很小的小孩子。一般是小学五六年级和初高中学生,说话连贯性和准确度都类似成人了,我觉得可以用成人的数据。另外,目前调研了一下,也就是用esp-sr的方案最好,用其他方案都会有一些受限于算力和能耗方面的问题。如果可以的话,请帮忙训练一个吧,期待~~ |
Hi all,
We're excited to offer the community more free and high-quality wake word models. Everyone has their own unique wake word preferences. Now, we're ready to regularly release some of the most popular wake words. Please let us know the wake words you want! English and Chinese are both welcome.
In the past, it was an expensive process to collect high-quality human speech data. But now, our team has developed a cost-effective way to train wake word models by using only TTS samples, which reaches 90-95% accuracy compared to models trained by human-recorded samples.
The wake word models and esp-sr have the same license and are free for commercial use. If you want a more accurate and exclusive wake word, please use our wake word customization service.
Currently, we support over 20 wake words. You can choose any one wake word to test. Starting from August 1, 2024, to get a new wake word, you'll need to meet one of these requirements:
We are preparing to upgrade to a new TTS model and generate some wake word models with better performance.
The text was updated successfully, but these errors were encountered: