Ultrasound Image Segmentation and Reinforcement Learning Navigation

In this project, I developed a system that combines deep learning-based image segmentation with reinforcement learning to automatically navigate to regions of interest in ultrasound images. This approach has potential applications in medical imaging and robotic ultrasound guidance.

Project Overview

Image Segmentation with ResNet U-Net

For the segmentation task, I used a U-Net architecture with a ResNet18 backbone. This model was trained on a dataset of abdominal ultrasound images to identify regions of interest.

The model was trained using a combination of binary cross-entropy and Dice loss to ensure accurate segmentation of the regions of interest.

Finding Centers of Segmented Regions

After segmentation, I implemented an algorithm to find the centers of the segmented regions:

Reinforcement Learning Navigation

For the navigation task, I implemented a DQN agent with experience replay to learn how to navigate to the centers of the segmented regions. The agent was trained to move a viewing window across the ultrasound image to find the region of interest.

Environment

The environment simulates a moving ultrasound probe that can navigate across the image:

Addressing Oscillation Issues

One of the key challenges in this project was addressing the oscillating behavior of the RL agent. I implemented several improvements to reduce oscillations:

Results

The trained agent successfully navigates to the centers of the segmented regions with a high success rate. The improvements to address oscillations significantly enhanced the agent's performance.

Example Navigation

Limitations and Challenges

Image Quality Limitations

There is a significant gap between the image quality used in training and what would be encountered in real-world ultrasound probe motion. Several factors impact the performance of the model in real-world scenarios:

Lighting Conditions: Variations in lighting can significantly affect ultrasound image quality. The training data may not capture the full range of lighting conditions encountered in clinical settings.
Probe Contact: The quality of contact between the ultrasound probe and the skin surface varies in practice. Poor contact can lead to artifacts and reduced image clarity that the model hasn't been trained to handle.
Patient Variability: Anatomical differences between patients (tissue density, body composition, etc.) create variations in ultrasound images that may not be well-represented in the training data.
Motion Artifacts: Patient movement during scanning introduces motion artifacts that can confuse the segmentation model.

Example of Failure Cases

When tested on new data with different image characteristics, the model sometimes fails to properly identify the target regions. These failure cases highlight the importance of diverse training data that captures the full range of conditions encountered in practice.

Training Data Sequence Importance

The sequence and quality of images used during training significantly impact the model's performance. Key observations include:

Data Consistency: Consistent image quality across the training dataset leads to better generalization.
Sequential Learning: The order in which examples are presented during training affects how well the agent learns navigation strategies.
Confidence vs. Distance Factors: In real-world deployment, we won't have prior knowledge of target locations. If the model relies too heavily on distance-based rewards rather than developing confidence in identifying anatomical features, it may struggle in practical applications.

Balancing Confidence and Distance

For robust real-world performance, the model needs to balance:

Feature Recognition: Learning to identify anatomical features regardless of their position in the image
Efficient Navigation: Developing strategies to move toward identified targets with minimal steps
Uncertainty Handling: Gracefully handling cases where target identification is uncertain

Conclusion

This project demonstrates the potential of combining deep learning-based image segmentation with reinforcement learning for automated navigation in medical imaging. The approach could be extended to real-world applications such as robotic ultrasound guidance, where a robot could automatically position an ultrasound probe to capture images of specific anatomical structures.