Unmanned Orchard Robot
Vision-Based Autonomous Guidance and Yield Monitoring
1. Overview
This project introduces a ROS-based autonomous robot designed for modernizing orchard management. By leveraging computer vision and SLAM, the robot can navigate orchard rows, monitor fruit growth status, and detect diseases in real-time. This system aims to solve critical challenges in precision agriculture, such as labor shortages and the need for timely data-driven interventions. Our key achievement was the development of a fully integrated platform that successfully performed these tasks in a complex environment, ultimately winning the Grand Prize at the Agricultural Robot Competition hosted by the Rural Development Administration of Korea.
2. The Challenge: Precision Agriculture in Orchards
Orchard environments pose unique challenges for automation.
- GNSS-Denied Environment: Dense canopies block GPS signals, making standard navigation methods unreliable.
- Unstructured Terrain: Irregular row spacing and scattered obstacles require robust perception and dynamic path planning.
- Variable Conditions: Fluctuating light and weather conditions demand a vision system that is resilient to change.
Our goal was to build a cost-effective robot that could reliably operate under these constraints using primarily vision and LiDAR sensors.
3. System Architecture & Hardware
3.1 System Overview (Hardware & Software Stack)
The robot is built on a modular hardware and software architecture to ensure flexibility and robustness.
- Hardware Stack:
- Chassis: TurtleBot3 Burger
- Single Board Computer: NVIDIA Jetson Nano
- Primary Sensor (SLAM): RPLiDAR A2M8 2D LiDAR
- Vision Sensors: 2 x Logitech C270 Webcams
- Controller: OpenCR 1.0 with Dynamixel motors
- Software Stack:
- OS: Ubuntu 18.04
- Framework: Robot Operating System (ROS1) Melodic
- Key Libraries:
GMappingfor SLAM,PyTorchfor deep learning,OpenCVfor image processing.
3.2 Hardware Iterations (Three Stages)
Throughout development, the robot’s hardware was iteratively upgraded—primarily adjusting the camera type, position, and orientation to achieve more accurate fruit counting.
- Stage 1 – Initial Side-View Prototype: Two low-cost webcams mounted on the left and right sides of the chassis, providing only side-facing views of tree rows. This configuration was adequate for basic navigation tests but missed central fruits and therefore produced incomplete fruit-count statistics.
- Stage 2 – Field Prototype: Dual stereo webcams angled downward and wider FOV; enabled depth estimation and better fruit localization while upgrading compute to Jetson Nano.
- Stage 3 – Competition-Ready: High-resolution cameras re-mounted higher with optimized tilt for canopy coverage; refined LiDAR placement, ruggedized enclosure, and full sensor calibration for precise, consistent fruit counting in real orchards.
4. Dataset Collection & Preparation Strategy
To develop a robust fruit detection system, we implemented a systematic and comprehensive dataset preparation strategy that emphasized diversity and bias reduction.
4.1 Initial Laboratory Dataset (793 images)
We first created a controlled dataset in our laboratory environment to establish baseline performance:
- Fruit Combinations: Healthy fruits (0-5), Diseased fruits (0-2)
- Distance Variations: 3 different camera-to-tree distances
- Angular Coverage: 10 different angles per setup
- Position Adjustments: Complete fruit repositioning for 2x variation
- Total Systematic Combinations: 6×3×3×10×2 = 1,080 planned images
4.2 Bias Reduction Strategies
To address potential data biases, we implemented several corrective measures:
1) Scale Bias Correction: Added close-up fruit images to counteract the bias toward small-scale fruits
- Healthy fruits: 2, 1, 0 configurations
- Diseased fruits: 4, 3, 2, 1, 0 configurations
- 2 images per configuration: 3×5×2 = 30 additional images
2) Angular Diversity: 180-degree rotation captures with repositioning to handle non-frontal detection scenarios
3) Environmental Adaptation: Systematic variation in lighting conditions and camera settings
Final Lab Dataset: 786 images with rich diversity and minimal bias
4.3 Competition Environment Adaptation (345 images)
- Transfer Learning Strategy: Used lab dataset for pre-training, then fine-tuned with competition environment data
- Camera-Specific Tuning: Adapted to actual camera specifications, lighting, and field conditions
- Validation: Lab-only model (best.pt) outperformed mixed-data model, confirming our diversity-first approach
5. Model Development & Optimization
5.1 Model Selection & Evolution
Model Evolution Process:
- Initial Model: YOLOv5s with pretrained COCO weights
- Optimization: Transitioned to YOLOv5n for Jetson Nano compatibility
- Transfer Learning: Leveraged ImageNet and COCO pretrained weights over random initialization
- Edge Optimization: Model specifically tuned for real-time inference on edge hardware
5.2 Training Strategy
- Epochs: 200 epochs with best model checkpoint saving
- Early Stopping: Disabled to ensure complete training convergence
- Anchor Optimization: YOLOv5’s automatic anchor optimization using K-means and genetic algorithms
- Auto-Scaling: Automatic image size normalization for robust performance
5.3 Technical Insights
- Bounding Box Adaptation: Camera width changes required retraining due to target proportion variations
- Rotation Sensitivity: Simple rotation affected tree detection (vertical vs. horizontal orientation)
- Mosaic Augmentation: Enhanced model robustness through YOLOv5’s advanced data augmentation
6. Core Technologies
6.1 Autonomous Navigation with SLAM
To navigate without GPS, the robot uses the GMapping SLAM algorithm. The 2D LiDAR sensor scans the environment to build a map of tree trunks and other obstacles. This map, combined with wheel odometry data from the Dynamixel motors, allows the robot to accurately determine its position and navigate autonomously along the orchard rows.
6.2 Real-time Fruit Detection & Classification
Our YOLOv5n model performs real-time detection with the following specifications:
- Classes: Tree, Healthy Fruit, Diseased Fruit
- Accuracy: 97% on test dataset
- Inference Speed: Optimized for Jetson Nano real-time processing
- Robustness: Handles various lighting conditions and viewing angles
7. Results & Competition Success
The final integrated system was tested in a mock orchard environment. The robot successfully navigated the rows, detected all target trees, and created a position map of healthy and diseased fruits. The project’s success was recognized with the Grand Prize at the 60th-anniversary Agricultural Robot Competition.
Key Achievements:
- Successful autonomous navigation in GPS-denied environment
- 97% accuracy in fruit and disease detection with systematic dataset preparation
- Real-time processing on edge computing platform (Jetson Nano)
- Robust performance under various lighting and weather conditions
- Advanced data preparation strategy with bias reduction techniques
- Grand Prize winner at national agricultural robotics competition
8. Technical Contributions & Lessons Learned
Data Science Contributions:
- Systematic dataset design methodology emphasizing diversity over quantity
- Bias identification and mitigation strategies for agricultural computer vision
- Transfer learning optimization for domain-specific applications
- Edge computing model optimization maintaining high accuracy
Engineering Insights:
- Camera-specific retraining necessity for bounding box accuracy
- Importance of angular diversity in agricultural object detection
- Edge hardware constraints driving model architecture decisions
- Integration challenges between SLAM navigation and computer vision systems
9. Conclusion & Future Work
This project successfully demonstrated the feasibility of a low-cost, vision-based robot for orchard automation with a particular emphasis on rigorous data preparation and model optimization for edge computing environments.
Technical Contributions:
- Integration of SLAM navigation with computer vision for precision agriculture
- Edge-optimized deep learning model with systematic dataset preparation
- Robust system design for challenging outdoor environments
- Comprehensive bias reduction methodology for agricultural AI
Potential next steps include:
- Testing and fine-tuning the system in real-world orchard environments
- Integrating robotic manipulation for automated harvesting based on detection results
- Improving long-term localization robustness with visual-inertial SLAM
- Expanding detection capabilities to multiple fruit varieties and disease types
- Scaling dataset preparation methodology for larger agricultural applications