Lang3D-XL: Language Embedded 3D Gaussians for Large-scale Scenes

Reading time: 2 minute
...

📝 Original Info

  • Title: Lang3D-XL: Language Embedded 3D Gaussians for Large-scale Scenes
  • ArXiv ID: 2512.07807
  • Date: 2025-12-08
  • Authors: Shai Krakovsky, Gal Fiebelman, Sagie Benaim, Hadar Averbuch-Elor

📝 Abstract

📄 Full Content

Fig. 1. Language Embedded 3D Gaussians for Large-scale Scenes. Given Internet collections of images depicting large-scale scenes (left), we augment a 3D Gaussian representation with a learnable semantic bottleneck (center). Our approach enables interactive text-based virtual exploration (right), enabling users to zoom-in and view semantic regions of interests, such as the spires and lintels * in the Milano Catedral depicted above. * A spire refers to a tall, slender, pointed structure on top of a roof of a building or tower, and a lintel is a horizontal structural element that spans openings such as doors and windows.

Embedding a language field in a 3D representation enables richer semantic understanding of spatial environments by linking geometry with descriptive meaning. This allows for a more intuitive human-computer interaction, enabling querying or editing scenes using natural language, and could potentially improve tasks like scene retrieval, navigation, and multimodal reasoning. While such capabilities could be transformative, in particular for large-scale scenes, we find that recent feature distillation approaches cannot effectively learn over massive Internet data due to challenges in semantic feature misalignment and inefficiency in memory and runtime. To this end, we propose a novel approach to address these challenges. First, we intr

…(Content truncated for length.)

📸 Image Gallery

00017_Calligraphy_Panels.jpg 00017_Reliefs.jpg 00017_domes.jpg 00017_minarets.jpg 00017_pca.jpg 00017_towers.jpg 00017_windows.jpg 0001_vis_vis_halonerf.jpg 0001_windows.jpg 0002_s_117.png 0002_s_139.png 0002_s_148.png 0002_s_163.png 0002_s_174.png 0002_s_179.png 0002_s_180.png 0002_s_2.png 0002_s_4.png 0002_sam_snalysis.jpg 00041_pca.jpg 00041_spires.jpg 00041_windows.jpg 00047_Calligraphy_Panels.jpg 00047_domes.jpg 00047_minarets.jpg 00047_pca.jpg 00054_Calligraphy_Panels.jpg 00054_domes.jpg 00054_minarets.jpg 00054_pca.jpg 00056_pca.jpg 00056_spires.jpg 00056_windows.jpg 00075_Reliefs.jpg 00075_pca.jpg 00075_towers.jpg 00075_windows.jpg 00083_Reliefs.jpg 00083_pca.jpg 00083_towers.jpg 00083_windows.jpg 00100_pca.jpg 00100_spires.jpg 00100_windows.jpg 00119_pca.jpg 00119_spires.jpg 00119_windows.jpg 00134_pca.jpg 00134_spires.jpg 00134_windows.jpg 00150_pca.jpg 00150_spires.jpg 00150_windows.jpg 00185_Calligraphy_Panels.jpg 00185_domes.jpg 00185_minarets.jpg 00185_pca.jpg 00237_Reliefs.jpg 00237_pca.jpg 00237_towers.jpg 00237_windows.jpg 0023_minarets.jpg 0023_vis_halonerf.jpg 0024_blue_domes.jpg 00262_Calligraphy_Panels.jpg 00262_Reliefs.jpg 00262_domes.jpg 00262_minarets.jpg 00262_pca.jpg 00262_towers.jpg 00262_windows.jpg 00274_Calligraphy_Panels.jpg 00274_Reliefs.jpg 00274_domes.jpg 00274_minarets.jpg 00274_pca.jpg 00274_towers.jpg 00274_windows.jpg 0034_domes.jpg 0034_vis_halonerf.jpg 0038_portals.jpg 0038_vis_halonerf.jpg 0062_spires.jpg 0062_vis_halonerf.jpg 0074_vis_fixed_scale.jpg 0074_vis_full.jpg 0074_vis_gt.jpg 0074_vis_no_attention.jpg 0074_vis_no_attention_interp.jpg 0074_vis_no_dino.jpg 0074_vis_no_seg.jpg 0074_vis_ours.jpg 0074_vis_speedup.jpg 0074_vis_xyz.jpg 0105_domes.jpg 0105_vis_landsam.jpg 0142_fmgs_milano_windows.jpg 0142_vis_feature3dgs_resized_windows.jpg 0142_vis_langsplat.jpg 0142_windows.jpg 0209_feature3dgs_resized_domes.jpg 0209_fmgs_blue_domes.jpg 0209_langsplat_blue_domes.jpg 0209_ours_blue_domes.jpg 0229_domes.jpg 0229_vis_landsam.jpg 0229_windows.jpg 0235_vis_fixed_scale.jpg 0235_vis_full.jpg 0235_vis_gt.jpg 0235_vis_no_attention.jpg 0235_vis_no_attention_interp.jpg 0235_vis_no_dino.jpg 0235_vis_no_seg.jpg 0236_stpaul_towers.jpg 0270_vis_fixed_scale.jpg 0270_vis_full.jpg 0270_vis_gt.jpg 0270_vis_no_attention.jpg 0270_vis_no_attention_interp.jpg 0270_vis_no_dino.jpg 0270_vis_no_seg.jpg 0394_domes.jpg 0394_vis_landsam.jpg 0399_blue_archs.jpg 0415_milano_windows.jpg 0444_domes.jpg 0444_vis_landsam.jpg 0484_vis_halonerf.jpg 0484_windows.jpg 0497_stpaul_statues.jpg 0543_fmgs_stpaul_towers.jpg 0543_towers.jpg 0543_vis_feature3dgs_resized.jpg 0543_vis_langsplat.jpg 0555_domes.jpg 0555_vis_landsam.jpg 0559_domes.jpg 0559_vis_landsam.jpg 0577_blue_caligraphy.jpg 0606_stpaul_pinnacles.jpg 0678_vis_ours.jpg 0678_vis_speedup.jpg 0678_vis_xyz.jpg 0785_vis_ours.jpg 0785_vis_speedup.jpg 0785_vis_xyz.jpg 0800_milano_lintel.jpg 0800_milano_rose_win.jpg 2280_windows.jpg atest_a_rect_0093.jpg atest_a_rect_0103.jpg atest_a_rect_0368.jpg atest_a_rect_0383.jpg atest_a_rect_0448.jpg chopsticks_2.jpg egg_2.jpg frame_00100.png frame_00100_vis_feature3dgs.jpg frame_00100_vis_ours.jpg frame_00134.png frame_00134_vis_feature3dgs.jpg frame_00134_vis_ours.jpg frame_00246.png frame_00246_vis_feature3dgs.jpg frame_00246_vis_ours.jpg frame_00286.png frame_00286_vis_feature3dgs.jpg frame_00286_vis_ours.jpg frame_00297.png frame_00297_vis_feature3dgs.jpg frame_00297_vis_ours.jpg frame_00300.png frame_00300_vis_feature3dgs.jpg frame_00300_vis_ours.jpg frame_00310.png frame_00310_vis_feature3dgs.jpg frame_00310_vis_ours.jpg green_apple_1.jpg gt.png haloGS-overview_v5b.png nori_2.jpg orig.png pca_005.png pca_010.png pca_023.png pca_034.png pca_05.png pca_ava.png score_dome_005.png score_dome_010.png score_dome_023.png score_dome_034.png score_dome_05.png score_dome_ava.png score_sky_005.png score_sky_010.png score_sky_023.png score_sky_034.png score_sky_05.png score_sky_ava.png score_tower_005.png score_tower_010.png score_tower_023.png score_tower_034.png score_tower_05.png score_tower_ava.png spill_0.jpg stuffed_bear_0.jpg tea_in_a_glass_0.jpg teaser_5_brighter.png twizzlers_1.jpg waldo_1.jpg

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut