Published on 2025-08-07T06:18:08Z

Embedly

Embedly is a content fetching service that helps web and app developers create rich, interactive previews of URLs. It is not a traditional web crawler but an on-demand fetcher; it only visits a specific URL when a user shares that link on a platform that uses Embedly's service. For website owners, Embedly enhances how your content appears when shared, which can lead to higher engagement and click-through rates.

What is Embedly?

Embedly is a set of front-end developer tools that functions as a middleware service for generating rich content embeds and previews. It is a content fetcher, not a traditional web crawler, meaning it only accesses URLs that are explicitly provided by an end user through a client application. When making a request, it identifies itself with the user-agent string Mozilla/5.0 (compatible; Embedly/0.2; +http://support.embed.ly/). It supports over 200 content providers and uses its API to return standardized metadata for creating previews.

Why is Embedly on my site?

Embedly is accessing your website because a user has shared a link to your content on a platform that utilizes Embedly's services (e.g., a social media site or a content management system). It does not proactively crawl your site. Instead, it is triggered by a specific user action to fetch metadata—such as titles, descriptions, and thumbnails—to generate a rich preview for the shared link. The frequency of visits is entirely dependent on how often your content is shared on these platforms.

What is the purpose of Embedly?

The core purpose of Embedly is to improve the user experience on other platforms by transforming plain URLs into rich, interactive content previews. When a link is shared, Embedly retrieves information about the URL, checks it for malware, extracts content, and returns standardized metadata that the embedding platform can use. For website owners, this service provides a significant benefit by making your shared links more visually appealing and interactive, which can drive more traffic to your site and improve user engagement with your content.

How do I block Embedly?

If you wish to prevent Embedly from generating previews of your content, you can block it using your robots.txt file. Note that doing so will cause links to your site to appear as plain text URLs when shared on platforms that use this service.

To block Embedly, add the following lines to your robots.txt file:

User-agent: Embedly
Disallow: /

How to verify the authenticity of the user-agent operated by Embedly?

Reverse IP lookup technique

To verify user-agent authenticity, you can use host linux command two times with the IP address of the requester.
  1. > host IPAddressOfRequest
    This command returns the reverse lookup hostname (e.g., 4.4.8.8.in-addr.arpa.).
  2. > host ReverseDNSFromTheOutputOfFirstRequest
If the output matches the original IP address and the domain is associated with a trusted operator (e.g., Embedly), the user-agent can be considered legitimate.

IP list lookup technique

Some operators provide a public list of IP addresses used by their crawlers. This list can be cross-referenced to verify a user-agent's authenticity. However, both operators and website owners may find it challenging to maintain an up-to-date list, so use this method with caution and in conjunction with other verification techniques.