Object detection
Bounding-box and key-point detection with multi-object tracking across frames.
Six disciplines, one accountable team. Data engineering feeds classical machine learning, deep learning carries the workloads that won't fit hand-crafted features, computer vision and NLP put models against real inputs, and MLOps keeps the system honest in production.
DISCIPLINE MAP
Each discipline carries its own playbook, its own engineers and its own production history. Jump in at any layer; the team speaks across all of them.
The work that decides whether a model is worth training at all.
Classical supervised and unsupervised modelling, still the right tool for most real problems.
Neural architectures from ANN to Transformers. When the pattern is too dense for hand-crafted features.
Pixels to decisions: detection, segmentation, OCR, medical imaging.
Text and speech as first-class inputs — classification, embeddings, assistants, speech I/O.
Deployment, drift, edge, bias and explainability. Where the system earns trust or loses it.
DATA ENGINEERING
No model outperforms its data. We treat collection, cleaning, labelling, feature engineering and warehousing as engineering disciplines in their own right, versioned with the training code they feed.
Scrapers, partner APIs, streaming ingest, batch ETL. The data that reaches the warehouse is the data that trains everything downstream.
Outlier detection, missing-value imputation, unit harmonisation, de-duplication. Model quality ceilings are set here.
Annotation pipelines with human review, inter-annotator agreement, golden sets. Labels that survive the audit.
Feature stores, selection under leakage constraints, transformations versioned with training code.
Lakehouse design, governance tables, retention rules, residency. The layer that stays when models come and go.
For pipelines, lakehouses and the long-lived data layer behind AI systems, the dedicated discipline page has the full scope.
MACHINE LEARNING
For most business problems the right answer is a well-built gradient-boosted model with clean features, not the largest transformer in the lab. We still do the transformer work; we just don't start there.
DEEP LEARNING
We pick architectures on task shape, not vintage. Feed-forward nets still land; convolution still owns images at scale; recurrent state still beats attention on some signals; transformers dominate the rest. Fine-tuning is the default before full training.
Dense layers, back-prop. Still the simplest baseline when tabular structure is lost.
Translation-invariant feature detectors for images, spectrograms and structured 2D signals.
State across time for text, audio and telemetry. Superseded but not obsolete.
Self-attention over tokens. The substrate for modern language, vision and multi-modal models.
Distributions learnt well enough to sample from. Images, speech, molecules, code.
Pretrained weights plus targeted training. The pragmatic default when budget and data are finite.
COMPUTER VISION
Vision systems that go to production need more than a strong backbone. Dataset design, calibration, latency budgets and domain-specific review loops are where the release stands or falls.
Bounding-box and key-point detection with multi-object tracking across frames.
Semantic and instance segmentation for autonomy, inspection and medical domains.
Recognition, verification and affect analysis under privacy-aware configuration.
Denoising, super-resolution, deblur, restoration for low-signal input.
Layout parsing, table extraction, handwriting recognition, structured document output.
MRI, CT, X-ray, histopathology pipelines with radiologist-in-the-loop review.
Dataset design, calibration, inspection pipelines and medical-imaging review loops live on the dedicated computer vision page.
NATURAL LANGUAGE
Most NLP "wins" collapse on messy, multilingual, domain-specific input. We tune encoders on in-domain data, pair LLMs with retrieval instead of trusting open-book inference, and ship speech with diarisation and noise guards.
Review mining, intent detection, moderation queues — fine-tuned encoders outperform zero-shot at scale.
Word, sentence and domain-tuned embeddings. The retrieval layer underneath modern assistants.
Closed-book LLM, retrieval-grounded QA, multi-turn dialogue with tool use and guardrails.
Domain-tuned MT, long-form abstractive summarisation, style-preserving rewriting.
Streaming transcription, speaker diarisation, neural voice synthesis for product and accessibility surfaces.
OPERATIONS
Deployment is the easy half. Monitoring, drift detection, on-device compression, bias audit, privacy engineering and explainability are the layer that turns a demo into an accountable production system.
BRANCHES
Three pages live today; three are scoped and on the near roadmap. Each carries its own playbook, partners and production history.
Pipelines, lakehouses and the data layer that makes AI reproducible and compliant.
Open page → Branch 02 · LiveClassical ML and deep learning, training pipelines, evaluation suites, model registries.
Open page → Branch 03 · LiveDetection, segmentation, OCR, medical imaging and video analytics pipelines.
Open page → Branch 04 · LiveArchitecture selection, distributed training, generative systems, fine-tuning and inference optimisation.
Open page → Branch 05 · LiveDomain-tuned encoders, retrieval-grounded assistants, translation, speech I/O and safety engineering.
Open page → Branch 06 · LiveDeployment, drift detection, inference optimisation, bias audit, XAI and privacy engineering.
Open page →Bring a problem. We come back with a feasibility note, cost envelope and first-stage plan inside ten working days. No discovery theatre, no capability decks, no model-name bingo.