Data - IMTI

Also at Deasil Works · txn2 · Plexara

Profiles GitHub · X · LinkedIn

Theme Light · Auto · Dark

Professional notes by Craig Johnston
long-form, short-form, working drafts · since 2008

VOL. XIX · MMXXVI
126 NOTES IN PRINT

Tag: Data

27 NOTES

The Agent Gateway to Your Platform: mcp-data-platform

An Apache-2.0 MCP server that gives frontier models access and grounded context to everything you built

KUBERNETESFOSSPLATFORMMCP

Self-Hosted BI with Apache Superset

Dashboards and SQL exploration over the lakehouse and Postgres, a Tableau replacement on the platform's SSO

KUBERNETESFOSSPLATFORMSUPERSET

Self-Hosted Integration with Apache NiFi

Visual dataflow and ETL on your own cluster, instead of Workato and the iPaaS vendors

KUBERNETESFOSSPLATFORMNIFI

Your Own Snowflake: A Trino and Iceberg Lakehouse

SQL analytics over open table formats on the object storage you own

KUBERNETESFOSSPLATFORMTRINO

Self-Hosted S3 with SeaweedFS

S3-compatible object storage for the data lake, backups, and the lakehouse

KUBERNETESFOSSPLATFORMSTORAGE

A Search Layer You Own: OpenSearch on Kubernetes

The index for the platform, on the Apache-licensed fork of Elasticsearch

KUBERNETESFOSSPLATFORMOPENSEARCH

Kafka Without ZooKeeper: KRaft on Kubernetes with Strimzi

A real-time event backbone, without the separate ZooKeeper ensemble to operate

KUBERNETESFOSSPLATFORMKAFKA

A Database You Own: Postgres with CloudNativePG

High-availability PostgreSQL on your own storage, without the managed bill

KUBERNETESFOSSPLATFORMPOSTGRESQL

Where Tribal Knowledge Goes

An insight an analyst shares mid-session becomes a reviewed, signed-off change to the catalog rather than vanishing when the conversation ends.

MCPAGENTSDATAAI

One Process, Many MCP Servers

Compose mcp-trino, mcp-datahub, and mcp-s3 as Go libraries behind one endpoint, instead of running three servers that can't see each other.

MCPGOAGENTSDATA

Build a Better Platform Than You're Renting

Cloud-grade data platforms from FOSS on Kubernetes, now that agents run the hard part

KUBERNETESDATAAIFOSS

MCP Is Flawed. Build With It Anyway.

Context has always been the hard problem. MCP forces you to solve it.

MCPAIDATAARCHITECTURE

AI Data Lake Access with MCP and S3

Building composable MCP servers for object storage

AI Data Warehouse Access with MCP and Trino

Building composable MCP servers for enterprise data

Advanced Platform Development with Kubernetes

Enabling Data Management, the Internet of Things, Blockchain, and Machine Learning

KUBERNETESDATAMACHINE LEARNINGBLOCKCHAIN

Kafka on Kubernetes

Deploy a highly available Kafka cluster on Kubernetes.

KAFKAKUBERNETESDATA

Elasticsearch Essential Queries

Getting started with Elasticsearch

ELASTICSEARCHDATA

Remote Query Elasticsearch on Kubernetes

Local workstation-based microservices development

KUBERNETESELASTICSEARCHDATA

High Traffic JSON Data into Elasticsearch on Kubernetes

Instant, reliable, send and forget.

KUBERNETESELASTICSEARCHDATAJSON

Kibana on Kubernetes

Visualize your Elasticsearch data.

KUBERNETESELASTICSEARCHDATAKIBANA

Production Grade Elasticsearch on Kubernetes

Setup a fast, custom production grade Elasticsearch cluster.

KUBERNETESELASTICSEARCHDATA

Python Data Essentials - Matplotlib and Seaborn

A beginners guide.

PYTHONDATADATA SCIENCEDATA VISUALIZATION

Webpage to PDF Microservice

Automate PDF Report Generation

MICROSERVICEPDFDATADOCKER

Python Data Essentials - Pandas

A data type equivalent to super-charged spreadsheets.

PYTHONDATADATA SCIENCE

Python Data Essentials - Numpy

Powerful N-dimensional array objects.

PYTHONDATADATA SCIENCE

SQL Foundations

Selects, joins and aliases.

SQLDATADATABASE

Don't Install cqlsh

Containers as utility applications

CASSANDRADOCKERDATA