Counting in three dimensions, not two.
Existing yield systems count in 2D and pay for it twice — fruit hidden behind leaves goes uncounted, and the same fruit gets re-counted across frames as the camera moves. Our edge is to reconstruct the canopy in 3D, so we see around occlusions, never double-count, and recover the metric that operators actually trade on: kilograms, not counts.
-
M.01
3D reconstruction of the canopy
A handheld or rail-mounted camera (an Insta360 works) sweeps each row weekly. We pair an off-the-shelf detector stack (YOLO + SAM) with structure-from-motion and 3D Gaussian Splatting to lift every row into a geometrically faithful, spatially registered 3D scene — each fruit a tracked instance with a position, a size, and a persistent identity across captures — building on recent work in 3D plant capture and photorealistic greenhouse reconstruction.
- → Multi-view: see fruit hidden behind leaves and other fruit
- → 3D identity: no double-counting across frames
- → True volume from geometry → weight via a per-cultivar size→mass model
-
M.02
Maturity staging from the canopy itself
Each tracked fruit is staged green → breaker → ripe from its 3D appearance. Knowing where each fruit sits in the ripening pipeline — not just how many there are — is what lets us forecast a harvest curve rather than a single number. Robustness across glazing, supplemental LEDs, and Mediterranean tunnel light is the open frontier we're investing in.
- → Per-fruit ripeness stage, not a per-row average
- → Census of every plant, not a sample of ten
- → Stable across glasshouses (Northern) and plastic tunnels (Southern)
-
M.03
Thermal-time forecasting
Tomato and pepper ripening is dominantly temperature-driven. We pair the per-fruit pipeline from M.01–M.02 with greenhouse climate data (temperature, daily light integral) to project when each cohort ripens and what it will weigh — kilograms harvestable per zone, per week, with uncertainty bands. Calibration is per-house and improves with every harvest cycle.
- → 1–3 week horizon — the length of the ripening pipeline
- → Disaggregated by zone for labour and logistics planning
- → Continuously recalibrated against the operator's actual harvest logs
The 3D capture is what makes everything downstream tractable: a weekly geometric record of the operation is a state representation in the modern ML sense — the kind of input recent computer-vision and forecasting research has learned to consume directly. Sized cohorts and ripening stages drive the harvest forecast. Geometry-and-colour anomalies can feed stress and disease detectors. Plant-level response to a fertigation change becomes attributable instead of averaged out. Pair that with the operator's existing climate and soil sensors and the result is a closed loop: measure → predict → act → re-measure.
TABLE 01 · Design targets vs. manual sampling
| Manual sampling | Earth Models (target) |
| Coverage | ~ 10 plants / block | Census of every plant we walk |
| Volume / weight | Sparse sample, extrapolated | Per-fruit, from 3D geometry (target ± 5 %) |
| Cadence | Weekly, sparse | Weekly, exhaustive |
| Forecast horizon | Gut-feel | 1 – 3 weeks, calibrated |