`parse_visual_obs_mode_to_struct` doesn't support obs_mode="sensor_data" #605

hesic73 · 2024-10-05T07:28:00Z

Line 114 in 362d43d

    
           SUPPORTED_OBS_MODES = ("state", "state_dict", "none", "sensor_data", "rgb", "depth", "segmentation", "rgbd", "rgb+depth", "rgb+depth+segmentation", "rgb+segmentation", "depth+segmentation", "pointcloud")

ManiSkill/mani_skill/envs/utils/observations/__init__.py

Lines 14 to 53 in 362d43d

    
           def parse_visual_obs_mode_to_struct(obs_mode: str) -> CameraObsTextures: 
        
               """Given user supplied observation mode, return a struct with the relevant textures that are to be captured""" 
        
               if obs_mode == "rgb": 
        
                   return CameraObsTextures( 
        
                       rgb=True, depth=False, segmentation=False, position=False 
        
                   ) 
        
               elif obs_mode == "rgbd": 
        
                   return CameraObsTextures( 
        
                       rgb=True, depth=True, segmentation=False, position=False 
        
                   ) 
        
               elif obs_mode == "depth": 
        
                   return CameraObsTextures( 
        
                       rgb=False, depth=True, segmentation=False, position=False 
        
                   ) 
        
               elif obs_mode == "segmentation": 
        
                   return CameraObsTextures( 
        
                       rgb=False, depth=False, segmentation=True, position=False 
        
                   ) 
        
               elif obs_mode == "rgb+depth": 
        
                   return CameraObsTextures( 
        
                       rgb=True, depth=True, segmentation=False, position=False 
        
                   ) 
        
               elif obs_mode == "rgb+depth+segmentation": 
        
                   return CameraObsTextures( 
        
                       rgb=True, depth=True, segmentation=True, position=False 
        
                   ) 
        
               elif obs_mode == "rgb+segmentation": 
        
                   return CameraObsTextures( 
        
                       rgb=True, depth=False, segmentation=True, position=False 
        
                   ) 
        
               elif obs_mode == "depth+segmentation": 
        
                   return CameraObsTextures( 
        
                       rgb=False, depth=True, segmentation=True, position=False 
        
                   ) 
        
               elif obs_mode == "pointcloud": 
        
                   return CameraObsTextures( 
        
                       rgb=True, depth=False, segmentation=True, position=True 
        
                   ) 
        
               else: 
        
                   return None

The 3D Diffusion Policy seems to require not only point cloud data and RGB images but also depth information, which obs_mode="pointcloud" doesn't support. Therefore, I expect the return value for obs_mode="sensor_data" should be:

CameraObsTextures(
            rgb=True, depth=True, segmentation=True, position=True
        )

By the way, in the point cloud representation xyzw, what does the w stand for?

The text was updated successfully, but these errors were encountered:

StoneT2000 · 2024-10-05T16:29:47Z

w stand for infinite points, it's a mask. 0 is infinity points and shouldn't be used if it's a pointcloud / depth datapoint

Your fix is correct, feel free to make a PR to fix (currently away so can't get to it now)

hesic73 mentioned this issue Oct 5, 2024

add missing 'sensor_data' handling in parse_visual_obs_mode_to_struct #606

Merged

StoneT2000 closed this as completed in #606 Oct 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`parse_visual_obs_mode_to_struct` doesn't support obs_mode="sensor_data" #605

`parse_visual_obs_mode_to_struct` doesn't support obs_mode="sensor_data" #605

hesic73 commented Oct 5, 2024 •

edited

Loading

StoneT2000 commented Oct 5, 2024 •

edited

Loading

parse_visual_obs_mode_to_struct doesn't support obs_mode="sensor_data" #605

parse_visual_obs_mode_to_struct doesn't support obs_mode="sensor_data" #605

Comments

hesic73 commented Oct 5, 2024 • edited Loading

StoneT2000 commented Oct 5, 2024 • edited Loading

`parse_visual_obs_mode_to_struct` doesn't support obs_mode="sensor_data" #605

`parse_visual_obs_mode_to_struct` doesn't support obs_mode="sensor_data" #605

hesic73 commented Oct 5, 2024 •

edited

Loading

StoneT2000 commented Oct 5, 2024 •

edited

Loading