Capability stacks
Tools we use, picked for the job.
We're tool-agnostic but opinionated. Selection is driven by use case, data gravity, latency, total cost of ownership and team operability — never by vendor preference.
01
Data Architecture & Engineering
Reliable, scalable, governed data platforms — built on modern lakehouse and warehousing patterns with the right ingestion, transformation and orchestration tooling for the job.
Lakehouse & Warehouse
- Databricks
- Snowflake
- BigQuery
- Redshift
- Azure Synapse
- Delta Lake
Ingestion & Streaming
- Fivetran
- Apache NiFi
- Kinesis Firehose
- Apache Kafka
- Azure Data Factory
- AWS Glue
Transformation & Orchestration
- dbt
- Apache Spark
- Matillion
- Airflow
- Dask
- Azure Blob
02
AI, ML & Analytics
From descriptive to predictive and prescriptive — the right technique and processing method per use case, across Generative AI, classical ML, statistical modelling and MLOps.
Generative AI & LLM
- OpenAI
- Azure OpenAI
- LangChain
- Hugging Face
- Vector DBs
- RAG
ML Frameworks
- PyTorch
- TensorFlow
- scikit-learn
- XGBoost
- MLflow
- Vertex AI
Languages & Compute
- Python
- R
- Scala
- Azure Data Lake
- Databricks ML
- SageMaker
03
Visualisation & BI
Visual reporting and visual analysis — dashboards that snapshot performance and interactive analytics that let users explore data, ask new questions and act on insight.
Enterprise BI
- Power BI
- Tableau
- Looker
- Qlik Sense
- MicroStrategy
Embedded & Web
- Looker Studio
- Apache Superset
- Metabase
- D3.js
- ECharts
Semantic & Modelling
- dbt Semantic
- Cube
- LookML
- Power BI Datasets
04
Application Engineering
Custom application development and cloud data migration — enterprise solutions architecture, DevOps, SOA and information architecture across Microsoft Azure and AWS.
Web & Backend
- Node.js
- Java
- Spring
- .NET
- Angular
- React
Mobile
- Flutter
- iOS / Swift
- Android / Kotlin
- React Native
DevOps & Cloud
- AWS
- Azure
- GCP
- Docker
- Kubernetes
- Terraform
05
Big Data Technologies
Distributed processing and high-throughput pipelines for petabyte-scale workloads — batch, micro-batch and real-time.
Processing
- Apache Spark
- Hadoop
- Flink
- Beam
- Presto / Trino
Streaming
- Kafka
- Kinesis
- Pub/Sub
- Spark Streaming
Storage Formats
- Parquet
- Iceberg
- Delta
- Hudi
- Avro
06
Databases
Right-fit storage for the workload — relational, NoSQL, time-series, graph and search — selected for consistency, throughput and access pattern.
Relational
- PostgreSQL
- MySQL
- SQL Server
- Oracle
- Aurora
NoSQL & Cache
- MongoDB
- DynamoDB
- Cassandra
- Redis
- Cosmos DB
Search & Specialty
- OpenSearch
- Elasticsearch
- Neo4j
- InfluxDB
- Pinecone