VAD Models¶
aana.core.models.vad
¶
VadParams
¶
Bases: BaseModel
A model for the Voice Activity Detection model parameters.
ATTRIBUTE | DESCRIPTION |
---|---|
chunk_size |
The maximum length of each vad output chunk.
TYPE:
|
merge_onset |
Onset to be used for the merging operation.
TYPE:
|
merge_offset |
"Optional offset to be used for the merging operation.
TYPE:
|
VadSegment
¶
Bases: BaseModel
Pydantic schema for Segment from Voice Activity Detection model.
ATTRIBUTE | DESCRIPTION |
---|---|
time_interval |
The start and end time of the segment
TYPE:
|
segments |
smaller voiced segments within a merged vad segment
TYPE:
|
to_whisper_dict
¶
Generate dictionary with start, end and segments keys from VADSegment for faster whisper.
RETURNS | DESCRIPTION |
---|---|
dict
|
Dictionary with start, end and segments keys
TYPE:
|