Given the licenses of respective raw datasets, we recommend users download the raw data from their official websites and then organize them following the below guide. Detailed steps are shown as follows.
-
Download ScanNet v2 data HERE. Link or move the folder to this level of directory.
-
Download 3RScan data HERE. Link or move the folder to this level of directory.
-
Download Matterport3D data HERE. Link or move the folder to this level of directory.
-
Organize the file structure. You are recommanded to create a soft link to the raw data folder under
mmscan_data/embodiedscan_split/data
.mmscan_data/embodiedscan_split/data/ ├── scannet/ │ ├── scans │ │ ├── <scene_id> │ │ ├── ... ├── 3rscan/ │ ├── <scene_id> │ ├── ... ├── matterport3d/ │ ├── <scene_id> │ ├── ...
Additionally, create a
process_pcd
folder undermmscan_data/embodiedscan_split
to store the results. Similarly, we recommend using a symbolic link, as the total file size might be a little large (approximately 21GB)PS: If you have followed the embodiedscan tutorial to organize the data, you can skip these steps and link or copy the
data
folder tommscan_data/embodiedscan_split
.After all the raw data is organized, the directory structure should be as below:
mmscan_data ├── embodiedscan_split/ │ ├── data/ │ ├── process_pcd/ │ ├── embodiedscan-v1/ │ ├── embodiedscan-v2/ ├── MMScan-beta-release
-
Read raw files and generate processed point cloud files, by running the following scripts.
python process_all_scan.py --nproc 8 # If your various file directories do not match the configuration settings, define them using --