Alessamo
/

CDT-Task-Tagger

capability-tagging

Model card Files Files and versions

CDT-Task-Tagger / README.md

Alessamo's picture

Update README.md

20f393d verified 3 months ago

|

history blame contribute delete

2.32 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- Qwen/Qwen2.5-7B
	tags:
	- capability-tagging
	- task
	- qwen
	---
	# Model Card for CDT-Task-Tagger
	This model is a component of the Cognition-Domain-Task (CDT) framework, a comprehensive capability framework for Large Language Models presented in our paper CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task. It has been specifically fine-tuned to classify a given instruction into one of 16 task types.

	## Model Details

	### Model Description
	This model identifies the fundamental task a user wants the LLM to perform.

	- Model type: Qwen2ForCausalLM
	- Language(s) (NLP): English
	- License: Apache 2.0
	- Finetuned from model: Qwen2.5-7B-Base

	### Model Sources

	<!-- Provide the basic links for the model. -->

	- Repository: https://github.com/Alessa-mo/CDT
	- Paper Link: https://arxiv.org/abs/2509.24422

	### Basic Usage
	Please refer to https://github.com/Alessa-mo/CDT. You can run the following scripts to tag the cognition labels.
	```bash
	cd tag_annotate
	export CUDA_VISIBLE_DEVICES=0
	python annotate.py \
	--data_path path/to/your/data \
	--output_dir path/to/output/dir \
	--model_path CDT-Task-Tagger \
	--prompt_file ./prompt/annotation_prompt.jsonl \
	--cognition_skill_file ./prompt/cognition.json \
	--domain_skill_file ./prompt/domain.json \
	--task_skill_file ./prompt/task.json \
	--tag_type "task" \
	--batch_size 32
	```
	Note: Make sure your data is a JSON file and has the following format:
	```json
	[
	{
	"messages": [
	{
	"role": "user",
	"content": "xxxx"
	},
	{
	"role": "assistant",
	"content": "xxxx"
	}
	]
	},
	]
	```
	## Citation
	If you find this model useful, please cite:
	```bash
	@misc{mo2025cdtcomprehensivecapabilityframework,
	title={CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task},
	author={Haosi Mo and Xinyu Ma and Xuebo Liu and Derek F. Wong and Yu Li and Jie Liu and Min Zhang},
	year={2025},
	eprint={2509.24422},
	archivePrefix={arXiv},
	primaryClass={cs.CL},
	url={https://arxiv.org/abs/2509.24422},
	}
	```