Android Speech to Text using Google Api
Description:
Android Speech to Text Google Api – hello guys and welcome to another android studio tutorial, in this tutorial article I’m going to show you how to use android speech to text Google API. so basically as you can see on the below image here our activity will have one image view button and one edit text so whenever a user clicks the microphone little icon or a button it will open a white dialog box in which users should start talking to the microphone and then the Google API will recognize those words or sentences and after that, it will set those words into our edit text.
I showed you a few screenshots so you could understand how it works.
Creating Android Speech to text Application:
 To create new project in the android studio simply click the new project button as you can see in the below figure
Then select the empty activity and click on the next button
Then set Application name in my case I select SpeechToText and select java language if you working on java programming and click on the finish button
Android Speech To Text Button:
Make a simple button in your favorite editor in my case I used photoshop for creating a simple mic button and save it into png form
Now copy the designed mic button to your project drawable folder
Android Speech to text XML Code:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 |
<?xml version="1.0" encoding="utf-8"?> <androidx.constraintlayout.widget.ConstraintLayout xmlns:android="http://schemas.android.com/apk/res/android" xmlns:app="http://schemas.android.com/apk/res-auto" xmlns:tools="http://schemas.android.com/tools" android:layout_width="match_parent" android:layout_height="match_parent" tools:context=".MainActivity"> <EditText android:id="@+id/txtField" android:layout_width="310dp" android:layout_height="39dp" android:ems="10" app:layout_constraintBottom_toBottomOf="parent" app:layout_constraintEnd_toEndOf="parent" app:layout_constraintHorizontal_bias="0.497" app:layout_constraintStart_toStartOf="parent" app:layout_constraintTop_toTopOf="parent" app:layout_constraintVertical_bias="0.326" /> <ImageButton android:id="@+id/BtnMic" android:layout_width="wrap_content" android:layout_height="wrap_content" android:backgroundTint="@color/white" app:layout_constraintBottom_toBottomOf="parent" app:layout_constraintEnd_toEndOf="parent" app:layout_constraintStart_toStartOf="parent" app:layout_constraintTop_toBottomOf="@+id/txtField" app:layout_constraintVertical_bias="0.36" app:srcCompat="@drawable/microphone" /> </androidx.constraintlayout.widget.ConstraintLayout> |
so as you can see here I used one image view button and one edit text and I am going to use the onClickListener  event on this button to listen to the user’s words using android speech-to-text Google API
Android Speech to text Java Code:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 |
package com.example.speechtotext; import android.content.Intent; import android.os.Bundle; import android.speech.RecognizerIntent; import android.view.View; import android.widget.EditText; import android.widget.ImageView; import androidx.annotation.Nullable; import androidx.appcompat.app.AppCompatActivity; import java.util.ArrayList; public class MainActivity extends AppCompatActivity { ImageView Btnmic; EditText txtField; private static final int RecognizerResult=1; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); setContentView(R.layout.activity_main); Btnmic=findViewById(R.id.BtnMic); txtField=findViewById(R.id.txtField); Btnmic.setOnClickListener(new View.OnClickListener() { @Override public void onClick(View view) { Intent speechIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); speechIntent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL,RecognizerIntent.LANGUAGE_MODEL_FREE_FORM); speechIntent.putExtra(RecognizerIntent.EXTRA_PROMPT,"Speech to Text"); startActivityForResult(speechIntent,RecognizerResult); } }); } @Override protected void onActivityResult(int requestCode, int resultCode, @Nullable Intent data) { if(requestCode==RecognizerResult && resultCode== RESULT_OK) { ArrayList<String> match = data.getStringArrayListExtra(RecognizerIntent.EXTRA_RESULTS); txtField.setText(match.get(0).toString()); } super.onActivityResult(requestCode, resultCode, data); } } |
Code Explanation:
1 2 3 |
ImageView Btnmic; EditText txtField; |
First, I create those two objects for our two UI elements first is ImageView I name it Btnmic and for edittext I name it txtField
1 |
private static final int RecognizerResult=1; |
then I create a global variable of recognizer and assign a value
1 2 |
Btnmic=findViewById(R.id.BtnMic); txtField=findViewById(R.id.txtField); |
in onCreate method, I find the id for those two elements BtnMic is the id of the button and txtField is the id of edit tex field
1 2 3 4 5 6 7 8 9 10 |
Btnmic.setOnClickListener(new View.OnClickListener() {    @Override    public void onClick(View view) {        Intent speechIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH);        speechIntent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL,RecognizerIntent.LANGUAGE_MODEL_FREE_FORM);        speechIntent.putExtra(RecognizerIntent.EXTRA_PROMPT,"Speech to Text");        startActivityForResult(speechIntent,RecognizerResult);    } }); |
Then I create onclicklistener for the imageview element and pass onclicklistener as a parameter and in our onclick method we are going to use intent in order to use the android speech-to-text google API so in the intent, I pass RecognizerIntent and I used
ACTION_RECOGNIZE_SPEECHÂ
to connect the API. next I used speechIntent. putExtra method in the intent so the first parameter is RecognizerIntent.EXTRA_LANGUAGE_MODEL  and the second parameter is
RecognizerIntent.LANGUAGE_MODEL_FREE_FORMÂ Â so if you’re wondering what those two means well the first extra language model informs the recognizer which speech model to prefer and the language model free form use a language model based on free form speech recognition this is a value to use for the extra language model.
 Then I use intent to putExtra extra prompt so RecognizerIntent.EXTRA_PROMPT and the second parameter will be simple string so this extra prompt is just an optional text prompt to show to the user when actually asking them to speak so you can basically type here anything you want.
Then I use startActivityForResult and pass speech intent as a first parameter and as a second parameter, I pass integer global variable RecognizerResult.
For creating onActivityResult method press ctrl + o and type onActivityResult to find the method so we need to override this method and press the ok button
1 2 3 4 5 6 7 8 9 10 11 |
@Override    protected void onActivityResult(int requestCode, int resultCode, @Nullable Intent data) {        if(requestCode==RecognizerResult && resultCode== RESULT_OK)        {            ArrayList<String> match = data.getStringArrayListExtra(RecognizerIntent.EXTRA_RESULTS);            txtField.setText(match.get(0).toString());        }        super.onActivityResult(requestCode, resultCode, data);    } |
In onActivityResult I check the requestCode and the resultCode so in the if block I am going to say if requestCode is the same as our code which we passed in our global integer variable and resultCode is result_ok then store those words or text into our array list so just import array list.
Then I use edit text to set the text from the android speech to text API from our ArrayList and convert it to tostring.