dlpy.audio.AudioTable.create_audio_table¶
-
classmethod
AudioTable.
create_audio_table
(conn, data_path, metadata_path, features_parameters={'feature_scaling_method': 'STANDARDIZATION', 'frame_length': 25, 'frame_shift': 10, 'n_bins': 40, 'n_ceps': 40, 'n_output_frames': 500}, casout=None, task='speech2text')¶ Creates an Audio table and takes care of all the necessary steps
- Parameters
- connCAS
A connection object to the current session.
- data_pathstring
Path to the file that contains the list of audio files (this is expected to be on the server side).
- metadata_pathstring
Location to the metadata file (this is expected to be on the client side).
- features_parametersdict, optional
Parameters to be used while extracting audio features
- casoutstring, dict, or CASTable, optional
Resulting output CAS table
- taskstring, optional
Specifies the type of the task. Default is speech to text. Note: currently only support ‘speech2text’ (default)
- Returns
AudioTable
A table containing audio features of audio files as well as their labels. The resulting table can be directly used in the deep learning models.
- Raises
- DLPyError
If anything goes wrong at any point in the process of creating this AudioTable, it complains and prints the appropriate message.