Article intro - The LEMON dataset and surgical FM
LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgical Settings Chengan Che, Chao Wang, Tom Vercauteren, Sophia Tsoka, Luis C. Garcia-Peraza-Herrera presented by the VISURG lab at King's College London, accepted to #CVPR2026: "While existing open-access datasets have laid an amazing groundwork for major breakthroughs, the shift toward highly generalizable foundation models requires an entirely new scale of massive and diverse data. ๐ง๐ต๐ฒ ๐๐๐ ๐ข๐ก ๐๐ฎ๐๐ฎ๐๐ฒ๐: We compiled an extensive collection of over 4,000 high-resolution surgical videos spanning 35 distinct procedure types (both robotic and traditional). That is ๐ต๐ฏ๐ด ๐ต๐ผ๐๐ฟ๐ (๐ด๐ฑ ๐บ๐ถ๐น๐น๐ถ๐ผ๐ป ๐ณ๐ฟ๐ฎ๐บ๐ฒ๐) of high-quality footage—a massive leap in size and scope compared to existing alternatives. ๐๐ฒ๐บ๐ผ๐ป๐๐ ๐๐ผ๐๐ป๐ฑ๐ฎ๐๐ถ๐ผ๐ป ๐ ๐ผ๐ฑ๐ฒ๐น: To prove the effectiveness of this diverse data, we built LemonFM. It is pretrained on the LEMON dataset using a novel self-...







