Add validation analysis script for classification results

- Implemented a new script `val_test.py` to analyze classification results from a JSONL file.
- Extracted true labels and predicted responses, handling invalid entries gracefully.
- Generated a classification report with accuracy metrics and detailed statistics for each category.
- Added functionality to export results to CSV and save analysis reports.
- Included visualization of confusion matrix and category accuracy distribution.
- Ensured dynamic handling of categories based on the input data.
This commit is contained in:
2025-07-20 21:04:08 +08:00
parent 24ac0ed40c
commit 87f2756fdf
6 changed files with 30057 additions and 5 deletions

File diff suppressed because it is too large Load Diff