1- Self-supervised Depth Estimation and Motion Estimation in Colonoscopy [Hugging Face]

The project estimates depth and pose estimation from 2D image frames in colonoscopy in a semi and self-supervised manner. The projet uses a scale invariant depth log loss. Try out on your own images on hugging face at the link provided.

2- Visual & Tactile Synchrony Perception in Metaverse [Code]

The project aims to estimate the threshold value of exceptable latency between haptic and visual signal before it becomes noticeable for humans in metaverse. The project uses HTC vive for VR display and 3D Sense Touch Haptic device for haptic feedback.

3- Unlabelled Generation and Segmentation of Surgical Instruments [Code]

Figure showing the results of our proposed algorithm. (Top row) Original images. (Bottom row) Our segmenation result. Our method adopts a generation and segmentation strategy to learn a segmentation model with better generalization capability to domains that have no labelled data. The method leverages the availability of labelled data in a different domain. The generator does the domain translation from the labelled domain to the unlabeled domain and simultaneously, the segmentation model learns using the generated data while regularizing the generative model.

4- Real-time Surgical Augmented Reality (AR) System

Endoscope view of the surgeon though the da Vinci surgical endoscope. (Top) Image shows a prostate phantom (in blue) without AR visualization. (Bottom) Image shows prostate phantom with AR overlay. A patient’s MRI is deformably registered to the phantom. 3D mesh (pink) generated from MRI and the MRI plane is visible in the image. Transverse-plane of the MRI can is controlled using the surgical instrument. Notice how the MRI plane changes with the surgical instrument. At many locations tumor locations in red can be seen on the MRI plane. For details please click on the title of the video.

5- Evaluation of Complete AR System

Our AR guidance system transforms a patient's MRI data to the image frame. Since MRI is not real-time we register it first to real-time ultrasound (US). Then, US to endoscope camera image transformation is estimated using a chain of rigid body transformations. We estimated the registration error from the US to the camera image frame using a water-bath experiment. (A) Custom grid visible in the endoscope camera image (B) the cross wire points visible in sagittal US image. (C) full water-bath setup. Position of the US probe (named TRUS here for transrectal ultrasound) and the endoscope can be seen in the image. For details please click on the title of the video.

6- Color Depth Encoding for a Reliable depth estimation

Video showing our color depth encoding technique (CDE). (Top) Normal endoscope camera image. (Bottom) Our CDE visualization with tumor (the small blob that is changing color) and the virtual surgical instrument. The color of the tumor changes (from red to blue) as the distance of the surgical instrument from the tumor increases. For details please click on the title of the video.

7- Motion Parallax for better depth estimation

Video showing our preliminary experiment testing motion parallax (MP) depth cue for surgical applications. MP is strong and active depth cue in near-field. Its study in surgical applications is of importance, especially for stereo impaired clinician population. (Top) Monocular view without MP. (Bottom) The same scene with MP. The green dot is an arbitrary point on the surface of the scene and gray is a simulated tumor. Unlike the monocular view on the top, the 3D structure is clear with MP. We studied the combination MP with stereo depth cue in a within subjects expriment study. Results are promising and are currently being explored further.