[Objective-C] Use Google speech on iPhone

Do you want use google speech api to recognize text from a dictate?

If you want to add in your “test” project, (test project because it’s not a public API), you need to read Chrome Browser source code.

Here chromium url: src.chromium.org/viewvc/chrome/trunk/src/content/browser/speech

Chrome records audio chunks, accepts only a FLAC file, an open source audio codec (free lossless audio codec) file with a sample rate of 16000.0!

After upload to server:


and server responds with a json like this:

    "status": 0,
    "id": "f3847b5dcu4d657f6667f3pk4sc0a8ca-2",
    "hypotheses": [
        "utterance": "it works",
        "confidence": 0.8012238
        "utterance": "it works"

Now, using ASIFormDataRequest we can send in POST a file to google url waiting for JSON.
Objective-c code:

- (void) SpeechFromGooglezzz {
  NSURL *url = [NSURL URLWithString:@"https://www.google.com/speech-api/v1/recognize?xjerr=1&client=chromium&lang=en-US"];

  ASIFormDataRequest *request = [ASIFormDataRequest requestWithURL:url];
  NSString *filePath = [NSString stringWithFormat:@"%@%@", [[NSBundle mainBundle] resourcePath], @"tmpAudio.flac"];

  NSData *myData = [NSData dataWithContentsOfFile:filePath];
  [request addPostValue:myData forKey:@"Content"];
  [request addPostValue:@"audio/x-flac; rate=16000" forKey:@"Content-Type"];
  [request startSynchronous];

  NSLog(@"req: %@", [request responseString]);

But now is there a big problem… objective-c and iphone don’t recognize FLAC files, you need an intermediate passage to send the correct audio file.

Set up your server with FFMPEG (an audio/video converting/editing tool) and prepare a PHP/JSP/etc. that accept in post an audio file, call a PHP EXEC and launch ffmpeg with flac codec.
Then upload from server to google and return response!

Here code to record and listen audio file from iPhone. Create two buttons and copy/paste code:

- (IBAction)StartRec:(id)sender {
  NSDictionary *settings = [NSDictionary dictionaryWithObjectsAndKeys:
  [NSNumber numberWithFloat: 16000.0],                 AVSampleRateKey,
  [NSNumber numberWithInt: kAudioFormatMPEGLayer3],    AVFormatIDKey,
  [NSNumber numberWithInt: 1],                         AVNumberOfChannelsKey,
  [NSNumber numberWithInt: AVAudioQualityMax],         AVEncoderAudioQualityKey,

  NSError *error;
  NSURL *url = [NSURL fileURLWithPath:[NSString stringWithFormat:@"%@%@", [[NSBundle mainBundle] resourcePath], @"tmpAudio.mp3"]];
  recorder = [[AVAudioRecorder alloc] initWithURL:url settings:settings error:&error];

  if (recorder) {
    [recorder prepareToRecord];
    [recorder record];
  else NSLog(@"%@", [error description]);

- (IBAction)PlayAudio:(id)sender {
  NSString *path = [NSString stringWithFormat:@"%@%@", [[NSBundle mainBundle] resourcePath], @"tmpAudio.mp3"];

  SystemSoundID soundID;
  NSURL *filePath = [NSURL fileURLWithPath:path isDirectory:NO];
  AudioServicesCreateSystemSoundID((CFURLRef)filePath, &soundID);

- (IBAction)StopRec:(id)sender {
  if (recorder) [recorder stop];

That’s all, enjoy!

…and remember that speech function of google it’s a private API. You can’t use in a commercial app!