Site logo

Drifting Paper Boat Lab

MetaThief - A Tool for Quickly Getting Website Meta Information

MetaThief - A Tool for Quickly Getting Website Meta Information

Open Source
SEOMetadataAPIDeveloper

Just enter a link to quickly get all meta information of a website, also supports API access.

What is this?

MetaThief is a fast tool for retrieving website meta information. Just enter a URL to extract all metadata, with an API interface for easy integration into other applications.

Why I Built This?

Sometimes I collect websites and organize them into navigation-like pages for easy viewing and management. But the problem is, I need to manually search for meta information from each website's source code, which is time-consuming and labor-intensive.

Coincidentally, when I first started learning about SEO, I also learned that this meta information can be used to optimize search engine rankings, which gave me the motivation to complete this tool. So I started building this tool, initially with only the simplest features, such as extracting basic metadata and Open Graph tags. As I used this tool more and more, I added more features, such as extracting icons, robots.txt, and other information.

Core Features

  • Metadata Retrieval - Supports basic metadata and Open Graph tags
  • Icon Extraction - Get website favicon, icons in multiple sizes
  • robots.txt Crawling - One-click access to site crawler rules
  • API Support - RESTful API for programmatic access

Supported Meta Tags

CategoryTags
Basiclanguage, charset, viewport, title, description, keywords, author, generator
Linksfavicon, canonical, icons, theme
Open GraphogUrl, ogTitle, ogSiteName, ogDescription, ogImage, ogImageAlt, ogType
TwittertwitterSite, twitterCard, twitterTitle, twitterCreator, twitterDescription, twitterImage
Otherrobots

What Problem Does It Solve?

When organizing navigation sites, analyzing competitor websites, or doing SEO optimization, manually searching for meta information in source code is inefficient. MetaThief automatically extracts all metadata, saving time and effort.

API Usage

Get all metadata:

GET /api/meta?url=https://example.com

Get specific meta tags:

GET /api/meta?url=https://example.com&meta=title,description,favicon

Response example:

{
  "title": "Example Website",
  "description": "Website description",
  "favicon": "https://example.com/favicon.ico",
  "icons": [
    "https://example.com/favicon.ico",
    "https://example.com/apple-touch-icon.png"
  ]
}