Class: Google::Cloud::Speech::V1::SpeechClient

Inherits:
Object
  • Object
show all
Defined in:
lib/google/cloud/speech/v1/speech_client.rb,
lib/google/cloud/speech/v1/helpers.rb

Overview

Service that implements Google Cloud Speech API.

Constant Summary collapse

SERVICE_ADDRESS =

The default address of the service.

"speech.googleapis.com".freeze
DEFAULT_SERVICE_PORT =

The default port of the service.

443
GRPC_INTERCEPTORS =

The default set of gRPC interceptors.

[]
DEFAULT_TIMEOUT =
30
ALL_SCOPES =

The scopes needed to make gRPC calls to all of the methods defined in this service.

[
  "https://www.googleapis.com/auth/cloud-platform"
].freeze

Instance Method Summary collapse

Constructor Details

#initialize(credentials: nil, scopes: ALL_SCOPES, client_config: {}, timeout: DEFAULT_TIMEOUT, metadata: nil, exception_transformer: nil, lib_name: nil, lib_version: "") ⇒ SpeechClient

Returns a new instance of SpeechClient

Parameters:

  • credentials (Google::Auth::Credentials, String, Hash, GRPC::Core::Channel, GRPC::Core::ChannelCredentials, Proc)

    Provides the means for authenticating requests made by the client. This parameter can be many types. A Google::Auth::Credentials uses a the properties of its represented keyfile for authenticating requests made by this client. A String will be treated as the path to the keyfile to be used for the construction of credentials for this client. A Hash will be treated as the contents of a keyfile to be used for the construction of credentials for this client. A GRPC::Core::Channel will be used to make calls through. A GRPC::Core::ChannelCredentials for the setting up the RPC client. The channel credentials should already be composed with a GRPC::Core::CallCredentials object. A Proc will be used as an updater_proc for the Grpc channel. The proc transforms the metadata for requests, generally, to give OAuth credentials.

  • scopes (Array<String>)

    The OAuth scopes for this service. This parameter is ignored if an updater_proc is supplied.

  • client_config (Hash)

    A Hash for call options for each method. See Google::Gax#construct_settings for the structure of this data. Falls back to the default config if not specified or the specified config is missing data points.

  • timeout (Numeric)

    The default timeout, in seconds, for calls made through this client.

  • metadata (Hash)

    Default metadata to be sent with each request. This can be overridden on a per call basis.

  • exception_transformer (Proc)

    An optional proc that intercepts any exceptions raised during an API call to inject custom error handling.



96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
# File 'lib/google/cloud/speech/v1/speech_client.rb', line 96

def initialize \
    credentials: nil,
    scopes: ALL_SCOPES,
    client_config: {},
    timeout: DEFAULT_TIMEOUT,
    metadata: nil,
    exception_transformer: nil,
    lib_name: nil,
    lib_version: ""
  # These require statements are intentionally placed here to initialize
  # the gRPC module only when it's required.
  # See https://github.com/googleapis/toolkit/issues/446
  require "google/gax/grpc"
  require "google/cloud/speech/v1/cloud_speech_services_pb"

  credentials ||= Google::Cloud::Speech::V1::Credentials.default

  @operations_client = OperationsClient.new(
    credentials: credentials,
    scopes: scopes,
    client_config: client_config,
    timeout: timeout,
    lib_name: lib_name,
    lib_version: lib_version,
  )

  if credentials.is_a?(String) || credentials.is_a?(Hash)
    updater_proc = Google::Cloud::Speech::V1::Credentials.new(credentials).updater_proc
  end
  if credentials.is_a?(GRPC::Core::Channel)
    channel = credentials
  end
  if credentials.is_a?(GRPC::Core::ChannelCredentials)
    chan_creds = credentials
  end
  if credentials.is_a?(Proc)
    updater_proc = credentials
  end
  if credentials.is_a?(Google::Auth::Credentials)
    updater_proc = credentials.updater_proc
  end

  package_version = Gem.loaded_specs['google-cloud-speech'].version.version

  google_api_client = "gl-ruby/#{RUBY_VERSION}"
  google_api_client << " #{lib_name}/#{lib_version}" if lib_name
  google_api_client << " gapic/#{package_version} gax/#{Google::Gax::VERSION}"
  google_api_client << " grpc/#{GRPC::VERSION}"
  google_api_client.freeze

  headers = { :"x-goog-api-client" => google_api_client }
  headers.merge!() unless .nil?
  client_config_file = Pathname.new(__dir__).join(
    "speech_client_config.json"
  )
  defaults = client_config_file.open do |f|
    Google::Gax.construct_settings(
      "google.cloud.speech.v1.Speech",
      JSON.parse(f.read),
      client_config,
      Google::Gax::Grpc::STATUS_CODE_NAMES,
      timeout,
      errors: Google::Gax::Grpc::API_ERRORS,
      metadata: headers
    )
  end

  # Allow overriding the service path/port in subclasses.
  service_path = self.class::SERVICE_ADDRESS
  port = self.class::DEFAULT_SERVICE_PORT
  interceptors = self.class::GRPC_INTERCEPTORS
  @speech_stub = Google::Gax::Grpc.create_stub(
    service_path,
    port,
    chan_creds: chan_creds,
    channel: channel,
    updater_proc: updater_proc,
    scopes: scopes,
    interceptors: interceptors,
    &Google::Cloud::Speech::V1::Speech::Stub.method(:new)
  )

  @recognize = Google::Gax.create_api_call(
    @speech_stub.method(:recognize),
    defaults["recognize"],
    exception_transformer: exception_transformer
  )
  @long_running_recognize = Google::Gax.create_api_call(
    @speech_stub.method(:long_running_recognize),
    defaults["long_running_recognize"],
    exception_transformer: exception_transformer
  )
  @streaming_recognize = Google::Gax.create_api_call(
    @speech_stub.method(:streaming_recognize),
    defaults["streaming_recognize"],
    exception_transformer: exception_transformer
  )
end

Instance Method Details

#get_operation(name, options: nil) ⇒ Google::Gax::Operation

Gets the latest state of a long-running operation. Clients can use this method to poll the operation result at intervals as recommended by the API service.

Examples:

require "google/cloud/speech"

speech_client = Google::Cloud::Speech.new version: :v1

op = speech_client.get_operation "-"

# Process error operations.
log_error op.error if op.error?

if op.done?
  # Process completed operations.
  log_finished op.response, op.
else
  # Process pending operations.
  log_pending op.name, op.
end

Parameters:

  • name (String)

    The name of the operation resource.

  • options (Google::Gax::CallOptions)

    Overrides the default settings for this call, e.g, timeout, retries, etc.

Returns:

  • (Google::Gax::Operation)

Raises:

  • (Google::Gax::GaxError)

    if the RPC is aborted.



120
121
122
123
124
125
126
127
128
129
130
# File 'lib/google/cloud/speech/v1/helpers.rb', line 120

def get_operation name, options: nil
  proto_op = @operations_client.get_operation name, options: options

  Google::Gax::Operation.new(
    proto_op,
    @operations_client,
    Google::Cloud::Speech::V1::LongRunningRecognizeResponse,
    Google::Cloud::Speech::V1::LongRunningRecognizeMetadata,
    call_options: options
  )
end

#long_running_recognize(config, audio, options: nil) ⇒ Google::Gax::Operation

Performs asynchronous speech recognition: receive results via the google.longrunning.Operations interface. Returns either an +Operation.error+ or an +Operation.response+ which contains a +LongRunningRecognizeResponse+ message.

Examples:

require "google/cloud/speech"

speech_client = Google::Cloud::Speech.new(version: :v1)
encoding = :FLAC
sample_rate_hertz = 44100
language_code = "en-US"
config = {
  encoding: encoding,
  sample_rate_hertz: sample_rate_hertz,
  language_code: language_code
}
uri = "gs://bucket_name/file_name.flac"
audio = { uri: uri }

# Register a callback during the method call.
operation = speech_client.long_running_recognize(config, audio) do |op|
  raise op.results.message if op.error?
  op_results = op.results
  # Process the results.

   = op.
  # Process the metadata.
end

# Or use the return value to register a callback.
operation.on_done do |op|
  raise op.results.message if op.error?
  op_results = op.results
  # Process the results.

   = op.
  # Process the metadata.
end

# Manually reload the operation.
operation.reload!

# Or block until the operation completes, triggering callbacks on
# completion.
operation.wait_until_done!

Parameters:

  • config (Google::Cloud::Speech::V1::RecognitionConfig | Hash)

    Required Provides information to the recognizer that specifies how to process the request. A hash of the same form as Google::Cloud::Speech::V1::RecognitionConfig can also be provided.

  • audio (Google::Cloud::Speech::V1::RecognitionAudio | Hash)

    Required The audio data to be recognized. A hash of the same form as Google::Cloud::Speech::V1::RecognitionAudio can also be provided.

  • options (Google::Gax::CallOptions)

    Overrides the default settings for this call, e.g, timeout, retries, etc.

Returns:

  • (Google::Gax::Operation)

Raises:

  • (Google::Gax::GaxError)

    if the RPC is aborted.



307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
# File 'lib/google/cloud/speech/v1/speech_client.rb', line 307

def long_running_recognize \
    config,
    audio,
    options: nil
  req = {
    config: config,
    audio: audio
  }.delete_if { |_, v| v.nil? }
  req = Google::Gax::to_proto(req, Google::Cloud::Speech::V1::LongRunningRecognizeRequest)
  operation = Google::Gax::Operation.new(
    @long_running_recognize.call(req, options),
    @operations_client,
    Google::Cloud::Speech::V1::LongRunningRecognizeResponse,
    Google::Cloud::Speech::V1::LongRunningRecognizeMetadata,
    call_options: options
  )
  operation.on_done { |operation| yield(operation) } if block_given?
  operation
end

#recognize(config, audio, options: nil) {|result, operation| ... } ⇒ Google::Cloud::Speech::V1::RecognizeResponse

Performs synchronous speech recognition: receive results after all audio has been sent and processed.

Examples:

require "google/cloud/speech"

speech_client = Google::Cloud::Speech.new(version: :v1)
encoding = :FLAC
sample_rate_hertz = 44100
language_code = "en-US"
config = {
  encoding: encoding,
  sample_rate_hertz: sample_rate_hertz,
  language_code: language_code
}
uri = "gs://bucket_name/file_name.flac"
audio = { uri: uri }
response = speech_client.recognize(config, audio)

Parameters:

  • config (Google::Cloud::Speech::V1::RecognitionConfig | Hash)

    Required Provides information to the recognizer that specifies how to process the request. A hash of the same form as Google::Cloud::Speech::V1::RecognitionConfig can also be provided.

  • audio (Google::Cloud::Speech::V1::RecognitionAudio | Hash)

    Required The audio data to be recognized. A hash of the same form as Google::Cloud::Speech::V1::RecognitionAudio can also be provided.

  • options (Google::Gax::CallOptions)

    Overrides the default settings for this call, e.g, timeout, retries, etc.

Yields:

  • (result, operation)

    Access the result along with the RPC operation

Yield Parameters:

Returns:

Raises:

  • (Google::Gax::GaxError)

    if the RPC is aborted.



233
234
235
236
237
238
239
240
241
242
243
244
# File 'lib/google/cloud/speech/v1/speech_client.rb', line 233

def recognize \
    config,
    audio,
    options: nil,
    &block
  req = {
    config: config,
    audio: audio
  }.delete_if { |_, v| v.nil? }
  req = Google::Gax::to_proto(req, Google::Cloud::Speech::V1::RecognizeRequest)
  @recognize.call(req, options, &block)
end

#streaming_recognize(streaming_config, options: nil) ⇒ Google::Cloud::Speech::V1::Stream

Performs bidirectional streaming speech recognition: receive results while sending audio. This method is only available via the gRPC API (not REST).

Provides information to the recognizer that specifies how to process the request. A hash of the same form as +Google::Cloud::Speech::V1::StreamingRecognitionConfig+ can also be provided.

Examples:

require "google/cloud/speech"

speech_client = Google::Cloud::Speech.new
streaming_config = {
  config: {
    encoding: :linear16,
    language_code: "en-US",
    sample_rate_hertz: 16000
  }
}
stream = speech_client.streaming_recognize(streaming_config)

# Stream 5 seconds of audio from the microphone
# Actual implementation of microphone input varies by platform
5.times do
  stream.send MicrophoneInput.read(32000)
end

stream.stop
stream.wait_until_complete!

results = stream.results
result = results.first.alternatives.first
result.transcript #=> "how old is the Brooklyn Bridge"
result.confidence #=> 0.9826789498329163

Parameters:

Returns:

Raises:

  • (Google::Gax::GaxError)

    if the RPC is aborted.



69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
# File 'lib/google/cloud/speech/v1/helpers.rb', line 69

def streaming_recognize streaming_config, options: nil
  if streaming_config.is_a?(::Hash) &&
     streaming_config[:config] &&
     streaming_config[:config][:encoding]
    streaming_config[:config][:encoding] =
      streaming_config[:config][:encoding].upcase
  end
  V1::Stream.new(
    streaming_config,
    proc do |reqs|
      request_protos = reqs.lazy.map do |req|
        Google::Gax.to_proto(
          req,
          Google::Cloud::Speech::V1::StreamingRecognizeRequest
        )
      end
      @streaming_recognize.call(request_protos, options)
    end
  )
end