Web Fingerprint
23 checks — foundation of every audit
Fingerprint detects technologies, CMS, e-commerce platforms, hosting, SSL certificate, sitemap, keywords and generates vector embeddings for semantic comparison. Available from the BASIC plan.
CMS & E-commerce
WordPress, Joomla, Drupal, Ghost, Wix, Squarespace, Webflow, Typo3, Nette, Laravel, Django, HubSpot, Stranka.sk, Webnode, Blogger.
Shoptet, PrestaShop, WooCommerce, Magento, OpenCart, Webareal, Shopify, Shoper, Upgates.
Heuristic classification: e-shop, marketplace, blog, forum, social network, aggregator, news portal, wiki, portfolio, catalogue, booking, SaaS, streaming.
Tech Stack
JS: jQuery, React, Vue.js, Angular, Alpine.js, HTMX, Turbo, Stimulus, Svelte (with versions). CSS: Bootstrap, Tailwind, Bulma, Foundation. CDN: Cloudflare, CloudFront, Akamai, Fastly, jsDelivr.
Google Analytics, GTM, Facebook Pixel, Hotjar, Heureka, Sklik, Criteo, Google Ads, SmartSupp, Biano, Luigi's Box, CookieYes.
Cloudflare, Fastly, Akamai, CloudFront, Google CDN.
Payments & Fonts
GoPay, Stripe, PayPal, Comgate, Tatrapay, Sporopay, CardPay, Cash on Delivery, Bank Transfer.
Google Fonts (with family extraction), Adobe Fonts (Typekit), Font Awesome, Custom WOFF/WOFF2.
Server & SSL
Nginx, Apache, LiteSpeed, IIS, Tomcat + version + release year. Reverse proxy: Varnish, BigIP, HAProxy, Envoy, Traefik.
Facebook, Instagram, Twitter/X, LinkedIn, YouTube, TikTok, Pinterest — URL links.
Checks whether the domain uses HTTPS with a valid certificate.
Identifies the certificate issuer (Let's Encrypt, DigiCert, Sectigo, ...).
Checks the expiration date and the number of days remaining until expiry.
Sitemap & Content
Checks whether the website has an accessible sitemap.xml or sitemap index.
Counts the URLs in the sitemap — used as the basis for tier recommendation.
Validates the format, URL correctness, and accessibility of the referenced pages.
Content & AI Enrichment
Strips HTML tags, scripts, and styles — produces a clean text representation of the page.
Basic content length metric for the analysed page.
Automatic extraction from URL paths, H1, title, meta, breadcrumbs, category tree, headings. Scoring: weight x log2(frequency + 1) x log2(product_count + 2).
Assigns keywords to categories (product, service, location, brand).
BGE-M3 model via OpenRouter — 1024-dim vectors stored in pgvector for semantic comparison.
Cosine similarity search across the embedding database — finds websites with similar content.
Tech Stack Modernity
Evaluates how modern the detected technologies are and the adoption of best practices. Output: tier (Legacy / Standard / Modern / Cutting-edge) + score 0-100. Composed of two equally weighted categories (50/50):

Total score calculation
Modernity Score = Tech Stack (50%) + Best Practices (50%)Each category is normalized to 0-100 and the result is their average.
Tech Stack (50% of total score)
The best detected technology is counted for each category. Max raw points: 43 (normalized to 0-100).
Best Practices (50% of total score)
Points for implementing modern web practices. Penalties for outdated HTML patterns.