Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection Paper • 2312.06295 • Published Dec 11, 2023
A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Paper • 2310.14364 • Published Oct 22, 2023 • 1
SegVol: Universal and Interactive Volumetric Medical Image Segmentation Paper • 2311.13385 • Published Nov 22, 2023
CADS: A Comprehensive Anatomical Dataset and Segmentation for Whole-Body Anatomy in Computed Tomography Paper • 2507.22953 • Published Jul 29
SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images Paper • 2310.15161 • Published Oct 23, 2023
MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images Paper • 2310.03559 • Published Oct 5, 2023
Text Promptable Surgical Instrument Segmentation with Vision-Language Models Paper • 2306.09244 • Published Jun 15, 2023 • 2
Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis Paper • 2501.09555 • Published Jan 16
seg2med: a segmentation-based medical image generation framework using denoising diffusion probabilistic models Paper • 2504.09182 • Published Apr 12
DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Paper • 2406.02518 • Published Jun 4, 2024 • 1
CT2Rep: Automated Radiology Report Generation for 3D Medical Imaging Paper • 2403.06801 • Published Mar 11, 2024
CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering Paper • 2505.16229 • Published May 22 • 1
3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation Paper • 2412.13059 • Published Dec 17, 2024 • 1
M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision Paper • 2509.01360 • Published Sep 1 • 11
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Paper • 2412.07769 • Published Dec 10, 2024 • 30
POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities Paper • 2307.10387 • Published Jul 19, 2023
Text-to-CT Generation via 3D Latent Diffusion Model with Contrastive Vision-Language Pretraining Paper • 2506.00633 • Published May 31 • 1
Multi-Branch Generative Models for Multichannel Imaging with an Application to PET/CT Joint Reconstruction Paper • 2404.08748 • Published Apr 12, 2024
TemMed-Bench: Evaluating Temporal Medical Image Reasoning in Vision-Language Models Paper • 2509.25143 • Published Sep 29