Training the TDNN for 7 epochs, on 91100 utterances, 70 archives and 163 iterations.
Evaluating:
- 10s: on 20284 segments (20282 for fusion cases involving pitch as a single feature)
Testing:
- 10s: on 18663 segments (18649 for fusion cases involving pitch as a single feature -- less by 0.075%)
- 3s: on 46202 segments (45801 for fusion cases involving pitch as a single feature -- less by 0.877%)
Training runtime: 23D features Accuracy: 0.941 C_primary: 0.078
Training runtime: 23D features Accuracy: 0.9308 C_primary: 0.0919
Training runtime: 23D features Accuracy: 0.9387 C_primary: 0.0802
Training runtime: 23D features Accuracy: 0.9378 C_primary: 0.0822
Training runtime: 72D features Accuracy: 0.945 C_primary: 0.070
Training runtime: 77D features Accuracy: 0.943 C_primary: 0.071
Training runtime: 76D features Accuracy: 0.945 C_primary: 0.072
Training runtime: 76D features Accuracy: 0.945 C_primary: 0.069
Training runtime: 69D features Accuracy: 0.944 C_primary: 0.074
Training runtime: 69D features Accuracy: 0.9454 C_primary: 0.0697
Training runtime: 69D features Accuracy: 0.9397 C_primary: 0.0797
Training runtime: 69D features Accuracy: 0.9451 C_primary: 0.0722
Training runtime: 5D features Accuracy: 0.860 C_primary: 0.177
Training runtime: 4D features Accuracy: 0.695 C_primary: 0.385
Training runtime: 1D features Accuracy: 0.720 C_primary: 0.359