Learning to build voice user interface is a great skill to have as devices like the Amazon Echo and Google Home are gaining popularity. Amazon is currently running an incentive program this July (2018) for people developing Alexa Skills. You develop a new skill and get a limited-edition Alexa developer T-shirt! If more than 100 customers enable and use your skill in the first 30 days of publication, you could also receive an Echo Dot.
Sounds pretty cool right? Apart from the T-shirt or Echo Dot, you’ll also learn developing something for voice assistant. If this seems like a good opportunity, read ahead!
WHAT IS ALEXA AND HOW DOES IT WORK?
Alexa is a cloud-based voice service that powers millions of voice experiences in the home. Devices Alexa powers includes Amazon Echo, Echo Dot, Amazon Tap and Amazon Fire TV.
What is a “skill”? They are basically apps(programs) that extend Echo’s functionality and features. There are lot of amazing skills already being developed by Alexa developers from over the world. Check them out here. They can answer questions, play trivia games, play music, set alarms, tell jokes and many more. Here is how it works:
- User speaks something
- The skill(defined on the Amazon Developer Site) matches the question(keywords) with an Intent(what the user is trying to ask) and then sends this Intent to an endpoint(an API call to the server like AWS Lambda )
- Lambda or your own server is where the data is stored. It has code to take the Intent and return the relevant response, taken from the data.
- Echo speaks back this response to the user.
WHAT DO YOU NEED TO START?
- An Amazon Developer account, to create a skill.
- An Amazon AWS account, to create a Lambda function.
- An idea! The skill you create is an app, and just like any other app in the market your app should have a good use case.
CREATING THE SKILL
I’ll explain how to create a fact skill in this blog post, you can dive into deeper to create some awesome skills. At the end, there is a list of amazing tutorials for you to learn more.
Go to Amazon Developer Account and sign in, and select Alexa.
On the Alexa Page, on the top-right, hover over the “Your Alexa Consoles” and click Skills.
Click on the “Create Skill” button.
Fill out the information,
- Skill Name: Number Fact
- Skill Model: Custom
- Skill Language: English(U.S.)
Click, Create Skill and you’ll see a page similar to this.
Now click Invocation in the left hand navigation panel, and set the Invocation Name. Invocation Name is what your user will need to say to start the skill. As the skill is number facts, setting the Invocation Name to: Number Facts, Click on Save Model and move to JSON Editor for the Interaction Model.
This is now the core of the skill, Here we define,
- an Intent Schema
- Custom Slot Types
- Sample Utterances
INTENT SCHEMA
These are the “intents” of the Skill, which define the “intent” or purpose of something the user says to the Skill. So we may have a “GetNewFactIntent” intent, which signifies that the user wants to know an fact. They are defined in this format:
{
"intents": [
{
"name": "AMAZON.CancelIntent"
},
{
"name": "AMAZON.HelpIntent"
},
{
"name": "AMAZON.StopIntent"
},
{
"name": "AMAZON.FallbackIntent"
},
{
"name": "GetNewFactIntent"
}
]
}
The AMAZON.HelpIntent is a default Amazon intent that you can use to handle any “Help” type questions. Likewise, AMAZON.StopIntent is a inbuilt intent to handle “stop” intents.
CUSTOM SLOT TYPES
We don’t actually need any Custom Slot Types for this basic Skill. These are used when you have a lists of related values — like colours, or makes of cars or countries. You can read more about them here.
SAMPLE UTTERANCES
Here we give examples of “utterances” — what the user may say for each intent. Each utterance goes on its own line and starts with the intent it is associated with e.g.
a fact
For each intent, we list the maximum possible ways that users may say it:
“a fact”,
“a number fact”,
“tell me a fact”,
“tell me a number fact”,
“give me a fact”,
“give me a number fact”,
“give me some number information”
We put all this in the JSON Editor and hit Save Model, and then Build Model. Your JSON should look similar to this
You’ll see a message like:
then…
Now, it’s time to set the endpoint, Click on “Endpoint” button in the left navigation bar. This where we specify where the Intent information goes when the user asks a question. The options are:
- AWS Lambda — a Lambda endpoint
- HTTPS — your own endpoint
We’ll be using AWS Lambda here.
AWS LAMBDA
If you haven’t used AWS Lambda before, it’s a service that allows you to host code with its own endpoint, without a server. More info here. Heard of Serverless apps?
Go to Amazon Web Services and sign in the portal, and go to Lambda service.
Once on Lambda home page, click on the orange button to create the Lambda function.
Choose “Author from Scratch”, and then name your function in camelCase, for me, numberFactFunction.
Choose, RunTime as Node.js 8.10, and in Role dropdown, choose, Create a Custom Role. This will open a new window, in that click on Allow. Read more about IAM Roles here
Then on the back on Lambda Page, select lambda_basic_execution in the Existing Role, and click “Create Function”.
Now, we need to add Trigger, for we are developing Alexa Skill Kit, click on Alexa Skill Kit in the left navigation.
In Configure trigger, copy the Skill ID from the Developer Console, and paste it here, and click on Add. Skill ID will be in the form amzn1.ask.skill…… After this is done, click on the numberFactFunction where we added the trigger, this will show the Function Code at the bottom.
Add the index.js code from here, also add the AlexaSkill.js from the same link in the function code by creating a new file.
File AlexaSkill.js has specific event handlers. It specifies the output, prompt and speech. The Index.js file lets you customize meet the needs of your Skill. Click on Save.
From the right-top of the page, copy the ARN value, it’s of the form arn:aws:lambda:region:function:nameOfFunction. Go to Amazon Developer Console, Select your skill and click on Endpoint in the left navigation bar. Select AWS Lambda ARN and paste the ARN code copied in the default region input box. Click on Save Endpoints. And you’ll see a popup similar to the below image saying Skill Manifest Saved Successfully.
In the top navigation bar, click on Test tab. If the test is disabled, enable it and either speak or type the Open Invocation Name , for this skill, “Open Number Facts”, and watch Alexa speaking the fact. 😀
Enter the distribution and Certification details as required, and submit the skill for certification. It takes a few days to hear back from the Amazon Developer Team. If your Skill is approved then everything works and all of the information is agreeing to their standards. Your Skill will be certified and published and available for others to use. If not, you will receive feedback and suggestions on what you need to to to resolve any issues so you can re-submit.
If your skill is unique across the alexa skills present, you’ll also receive the limited edition T-Shirt.
Hope you learned on how to build skills, if you develop more you’ll have more familiarity with JavaScript and Node.js. It’s very different creating something that is voice activated by an end user as compared to reading on a screen. Hoping to see more Alexa developers, feel free to contact in case there is any issue developing a skill.
Submit your skill details for review, at the link given below, to be eligible for the program:
https://developer.amazon.com/alexa-skills-kit/alexa-developer-skill-promotion-india
Good luck! 😀
– Mishal Shah