SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction
Paper
•
2507.15852
•
Published
•
38
Single-file model formats for the SeC (Segment Concept) video object segmentation model, optimized for use with ComfyUI SeC Nodes.
| Format | Size | Description | GPU Requirements |
|---|---|---|---|
| SeC-4B-fp16.safetensors | 7.35 GB | Recommended - Best balance of quality and size | All CUDA GPUs |
| SeC-4B-fp8.safetensors | 3.97 GB | VRAM-constrained systems (saves 1.5-2GB VRAM) | RTX 30 series or newer |
| SeC-4B-bf16.safetensors | 7.35 GB | Alternative to FP16 | All CUDA GPUs |
| SeC-4B-fp32.safetensors | 14.14 GB | Full precision | All CUDA GPUs |
SeC (Segment Concept) uses Large Vision-Language Models for video object segmentation, achieving +11.8 points improvement over SAM 2.1 on complex semantic scenarios (SeCVOS benchmark).
Key features:
These models are designed for use with the ComfyUI SeC Nodes custom nodes.
Installation:
ComfyUI/models/sams/These are converted single-file versions of the original model:
Original Model: Developed by OpenIXCLab
Single-File Conversions: Created for ComfyUI SeC Nodes
Apache 2.0 (same as original SeC-4B model)
If you use this model in your research, please cite the original SeC paper:
@article{zhang2025sec,
title = {SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction},
author = {Zhixiong Zhang and Shuangrui Ding and Xiaoyi Dong and Songxin He and
Jianfan Lin and Junsong Tang and Yuhang Zang and Yuhang Cao and
Dahua Lin and Jiaqi Wang},
journal = {arXiv preprint arXiv:2507.15852},
year = {2025}
}