This website requires JavaScript.
Explore
Help
Sign In
glowz
/
data-prepare
Watch
1
Star
0
Fork
0
You've already forked data-prepare
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
8
Commits
1
Branch
0
Tags
ecf62793008a0a0c67ca0cae78fe63eb98233364
Go to file
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
glowz
ecf6279300
添加多种问题模板生成和数据解析功能,优化数据转换流程
2025-07-26 11:16:28 +08:00
01-pre.py
swift
2025-07-18 18:00:04 +08:00
02-data_select_date_len.py
添加数据处理脚本,支持从原始数据筛选、抽样到转换为Alpaca格式
2025-06-09 14:39:07 +08:00
03-data_select_random.py
添加数据处理脚本,支持从原始数据筛选、抽样到转换为Alpaca格式
2025-06-09 14:39:07 +08:00
03-data_select_ratio.py
swift
2025-07-18 18:00:04 +08:00
04-data2swift.py
swift
2025-07-18 18:00:04 +08:00
05-data-csv-swift-pretrain.py
swift
2025-07-18 18:00:04 +08:00
05-data-csv-swift-sft.py
更新数据转换功能,支持从新格式提取信息并生成多种问题模板,优化输入输出文件路径
2025-07-19 17:06:10 +08:00
05-data-csv-xtuner.py
添加数据处理脚本,支持从原始数据筛选、抽样到转换为Alpaca格式
2025-06-09 14:39:07 +08:00
05-data-swfit-pretrain-revise.py
swift
2025-07-18 18:00:04 +08:00
05-data-swfit-sft2multi_type.py
添加多种问题模板生成和数据解析功能,优化数据转换流程
2025-07-26 11:16:28 +08:00
05-data-swfit-sft2pretrain.py
swift
2025-07-18 18:00:04 +08:00
05-data-swfit-xtuner.py
swift
2025-07-18 18:00:04 +08:00
05-data-xtuner-swfit.py
swift
2025-07-18 18:00:04 +08:00
06-data-swift-compose.py
swift
2025-07-18 18:00:04 +08:00
06-data-xtuner-compose.py
swift
2025-07-18 18:00:04 +08:00
arxiv-metadata-oai-snapshot--swift-26-500-m.jsonl
Add validation analysis script for classification results
2025-07-20 21:04:08 +08:00
arxiv-metadata-oai-snapshot--swift-26-500.json
swift
2025-07-18 18:00:04 +08:00
arxiv-metadata-oai-snapshot--swift-26-500.jsonl
Add validation analysis script for classification results
2025-07-20 21:04:08 +08:00
arxiv-metadata-oai-snapshot--swift-26-m.jsonl
Add validation analysis script for classification results
2025-07-20 21:04:08 +08:00
arxiv-metadata-oai-snapshot--swift-26.json
swift
2025-07-18 18:00:04 +08:00
arxiv-metadata-oai-snapshot--swift-26.jsonl
Add validation analysis script for classification results
2025-07-20 21:04:08 +08:00
arxiv-metadata-oai-snapshot--swift-26.jsonl.txt
swift
2025-07-18 18:00:04 +08:00
arxiv-metadata-oai-snapshot--swift-pretrain-26.jsonl
swift
2025-07-18 18:00:04 +08:00
crawl-arxiv.py
添加爬取arXiv论文的功能,支持根据查询获取论文标题、作者和摘要
2025-07-25 18:11:11 +08:00
README.md
first commit
2025-06-09 14:21:39 +08:00
val_test.py
Add validation analysis script for classification results
2025-07-20 21:04:08 +08:00
README.md
The file is empty.
Description
No description provided
Readme
19
MiB
Languages
Python
100%