🤓FAR-AVIO: Fast and Robust Acoustic-Visual-Inertial Odometry

Hao Wei, Peiji Wang, Qianhao Wang, Tong Qin, Fei Gao and Yulin Si

Zhejiang University, Shanghai Jiao Tong University


FAR-AVIO: Fast and Robust Schur-Complement Based Acoustic-Visual-Inertial Fusion Odometry with Sensor Calibration

Underwater environments impose severe challenges to visual–inertial odometry systems, as strong light attenuation, marine snow and turbidity, together with weakly exciting motions, degrade inertial observability and cause frequent tracking failures over long-term operation. While tightly coupled acoustic–visual–inertial fusion, typically implemented through an acoustic Doppler Velocity Log (DVL) integrated with visual–inertial measurements, can provide accurate state estimation, the associated graph-based optimization is often computationally prohibitive for real-time deployment on resource-constrained platforms. Here we present FAR-AVIO, a Schur-Complement based, tightly coupled acoustic-visual-inertial odometry framework tailored for underwater robots. FAR-AVIO embeds a Schur complement formulation into an Extended Kalman Filter(EKF), enabling joint pose–landmark optimization for accuracy while maintaining constant-time updates by efficiently marginalizing landmark states. On top of this backbone, we introduce Adaptive Weight Adjustment and Reliability Evaluation(AWARE), an online sensor health module that continuously assesses the reliability of visual, inertial and DVL measurements and adaptively regulates their sigma weights, and we develop an efficient online calibration scheme that jointly estimates DVL–IMU extrinsics, without dedicated calibration manoeuvres. Numerical simulations and real-world underwater experiments consistently show that FAR-AVIO outperforms state-of-the-art underwater SLAM baselines in both localization accuracy and computational efficiency, enabling robust operation on low-power embedded platforms. Our implementation has been released as open source software.

circle-check

1. System Running Example

  • Running Example with Log System on real-world underwater environments sequence (Acoustic-Visual-Inertial Mode), there was a continuous duration without any available visual features.

  • Meanwhile, the latest version of the code has integrated loop detection (the red line in the video represents AVIO output, and the green line represents the loop trajectory) and BA module, which can output high-precision sparse point clouds for visual reconstruction (eg, 3DGS).

  • AVIO with SLAM and reconstruction result, utilize the pose result from AVIO and the sparse point cloud provided by the backend SLAM module to perform offline mapping (pose prior BA and 3D GS rendering), we can get the following result.

  • Running Example with ROV in real world env

  • Running Environment and reconstruction result with out odom and slam module output

  • Running Example with Log System on Euroc V2_03_difficult sequence (Visual-Inertial Mode)

  • Running Example With Dense Reconstruction

  • Running Example with only FAR-VIDO

2. System Running Result

  • System Estimate Trajectory Result and Dense Reconstruction Result

  • System Architecture and DVL Measurement

3. Experiment Result

3. 1 Accuracy and Computational Load

3.2 Ablation Study

4. Multi-Camera/Omni Fisheye Camera Support

circle-check

Our latest version already supports multi-camera/omnidirectional fisheye camera, and also supports optical flow sensor observation updates. Please stay tuned!

5. Acknowledgments

Thanks to the researchers who have provided valuable suggestions for this work!

Last updated