dlpy.audio.AudioTable.create_audio_table

classmethod AudioTable.create_audio_table(conn, data_path, metadata_path, features_parameters={'feature_scaling_method': 'STANDARDIZATION', 'frame_length': 25, 'frame_shift': 10, 'n_bins': 40, 'n_ceps': 40, 'n_output_frames': 500}, casout=None, task='speech2text')

Creates an Audio table and takes care of all the necessary steps

Parameters
connCAS

A connection object to the current session.

data_pathstring

Path to the file that contains the list of audio files (this is expected to be on the server side).

metadata_pathstring

Location to the metadata file (this is expected to be on the client side).

features_parametersdict, optional

Parameters to be used while extracting audio features

casoutstring, dict, or CASTable, optional

Resulting output CAS table

taskstring, optional

Specifies the type of the task. Default is speech to text. Note: currently only support ‘speech2text’ (default)

Returns
AudioTable

A table containing audio features of audio files as well as their labels. The resulting table can be directly used in the deep learning models.

Raises
DLPyError

If anything goes wrong at any point in the process of creating this AudioTable, it complains and prints the appropriate message.