1. Streaming Processing: Points arrive one by one from a data stream and are processed in real-time.
2. Sliding Window: Only the most recent Ο points are kept in memory, older points are forgotten.
3. Dynamic Histograms: Each tree is a multi-dimensional histogram that splits bins when they reach maximum height Ξ·.
4. Learning & Forgetting: New points increase bin heights and create splits, while old points decrease heights and merge bins.
5. Anomaly Scoring: Points that require fewer splits to isolate (shorter path depth) get higher anomaly scores.
6. Ensemble Decision: Multiple trees vote to provide robust anomaly detection.
Merge Trigger: When left_child.height + right_child.height < Ξ·
(max leaf samples)
Why Merging Happens:
Visual Cues: Watch the split colors - Yellow (stable) β Orange (about to merge) β Red (just merged)