Built by experts. Powered by research.
Palimio combines deep expertise in social media strategy, data engineering, and applied machine learning into a single end-to-end intelligence platform. Every component is purpose-built.
Our team brings backgrounds in data science, NLP research, and content strategy for some of the largest media brands in the world. The platform reflects years of domain expertise distilled into automated, scalable pipelines.
System Architecture
Data Sources
Extensive historical data warehouse + live real-time feeds
Processing
Feature Engineering
Content Tagging
Normalisation
ML / Analytics
Embeddings
UMAP + HDBSCAN
Correlation Engine
Intelligence
Shift Detection
Gap Analysis
Strategic Synthesis
Presentation
Dashboard
PDF Reports
API
Our data engineering pipeline
Stage 01
Automated Data Ingestion
You enter your social media handles. We do the rest.
Enter your social media handles and the competitors you want to track. Our pipeline takes over from there, ingesting every post, normalising metadata across platforms, and storing it in a structured data warehouse optimised for analytical queries.
Under the hood
- Daily ingest. New posts are ingested continuously
- Cross-platform schema normalisation (timestamps, engagement metrics, content types)
- Scalable cloud data warehouse with partitioned tables for fast analytical reads
- Multiple gigabytes of data ingested, processed, and made query-ready
Stage 02
Content Analysis & Feature Engineering
Turning raw posts into structured creative signals.
Every post passes through a multi-stage feature engineering pipeline. Our proprietary models analyse the content itself (not just the metadata), extracting six distinct creative dimensions that define the 'DNA' of each piece of content. This goes far beyond hashtag counting or keyword matching.
Under the hood
- Six engineered dimensions: format, topic, emotional tone, opening hook, scene composition, production style
- Multi-modal analysis pipeline (video, image, caption, audio and metadata signals combined)
- Hierarchical classification with confidence scoring and edge-case handling
- Continuous model evaluation against human-labelled validation sets
Stage 03
Semantic Embedding & Clustering
Mapping content in high-dimensional space.
Each post is projected into a high-dimensional vector space based on its content profile. We then apply dimensionality reduction (UMAP) and density-based clustering (HDBSCAN) at two granularity levels to reveal the natural theme structure of your content library, and your competitors'.
Under the hood
- Dense vector embeddings capturing semantic similarity between posts
- UMAP projection for interpretable 2D visualisation (Content Galaxy)
- Two-tier clustering: macro themes and micro niches detected automatically
- Stored cluster centroids enable real-time assignment of new posts without re-clustering
Stage 04
Competitor Benchmarking Engine
Fair, normalised, dimension-level comparison.
Competitor data flows through the same analysis pipeline as your own, ensuring a true apples-to-apples comparison. Our benchmarking engine normalises sample sizes, computes per-feature performance deltas, and identifies the specific creative gaps where competitors outperform you, ranked by impact.
Under the hood
- Sample-size normalisation: primary account volume matched to competitor scrape depth
- Per-dimension performance breakdown. The actual video, image, and carousel components are compared and matched to what your competitors are producing. We find EXACTLY what's working and where the gaps are.
- Content Gap algorithm: ranks content by all dimensions and features, anchored against median views and engagement delta across you vs. competitor pool
- Per-competitor filtering so you can isolate any individual rival's strategy
Stage 05
Intelligence & Strategic Analysis
From patterns to prescriptions.
The processed data is fed through a structured intelligence layer that synthesises trends, detects performance shifts, and generates strategic recommendations grounded in your actual numbers. Every insight is citation-backed. No generic advice, no hallucinated statistics.
Under the hood
- Time-series shift detection with configurable sensitivity thresholds
- Engagement correlation analysis across all metric combinations
- Structured output: 'What's Happening' narrative + 'Recommendations' with named targets
- Executive summary generation that covers performance, production mix, cadence, and competitive positioning
Stage 06
Interactive Dashboard & Reporting
Research-grade analysis, consumer-grade interface.
Everything arrives in a real-time dashboard designed for strategists, not statisticians. Every chart is interactive, every breakdown is filterable, and the full report exports to a pixel-perfect A4 PDF in one click. The interface is built to surface insight fast. No training required.
Under the hood
- Server-rendered Next.js application with sub-second filter switching
- Per-platform intelligence with engagement formulas tuned to each platform's mechanics
- One-click PDF report with AI executive summary, KPIs, top posts, and competitor snapshot
Want to go deeper?
We're happy to walk through the methodology, model architecture, and data handling practices in detail.
Talk to us