About Brandfetch
Brandfetch is a leading source for structured brand data, providing logos, descriptions, brand profiles, and identity details for over 30 million brands worldwide, powering experiences for developers and teams that rely on accurate brand data. To deliver that, Brandfetch needs to reliably extract structured information from modern websites. Brandfetch maintains a continuous refresh pipeline, often reprocessing brands on a roughly 30 day cycle to ensure they always surface the most up to date information.
The Challenge
Extracting reliable brand identity data from modern websites requires JavaScript rendering, full HTML capture, and screenshot analysis. Early on, Brandfetch attempted to run this workflow internally using AWS Lambda, but the setup quickly became unreliable:
- Lambda executions failed frequently under load
- It wasn’t resilient enough for their high request volume and reliability issues compounded as volumes grew
- Maintaining a robust, scalable infrastructure would drain valuable engineering time
Brandfetch needed a low latency, scalable, and dependable browser automation API that could handle millions of requests per month and simply return rendered webpages, without the operational overhead.
The Solution
From early on in building their crawling infrastructure,, Brandfetch chose Browserless to handle their rendering workloads. Implementation was intentionally simple, as they call Browserless via straightforward REST GET requests, without any complex orchestration or custom infrastructure to maintain.
Brandfetch relies on Browserless when:
- A website requires JavaScript rendering
- They need accurate, fully rendered HTML
- Screenshots are necessary
Browserless enables them to extract brand assets by combining HTML and CSS data with screenshot based comparisons.
The Results: Millions of Requests Per Month
With Browserless, Brandfetch has built a scalable, low maintenance brand crawling system that keeps up with a constantly expanding dataset and modern, JavaScript heavy websites. Today, Browserless powers a core part of Brandfetch’s data pipeline, enabling them to successfully make:
These volumes run smoothly without requiring Brandfetch to manage any flaky headless sessions or tune any internal browser infrastructure. Instead, they focus on building the logic that makes their brand data uniquely valuable, while Browserless handles the messy infrastructure layer.
Speed, Reliability, and Day to Day Performance
Browserless’s speed and reliability were key reasons Brandfetch adopted the platform early on, and they remain central to why the team continues using it today. The solution has consistently delivered the performance needed at scale, powering the rendering and extraction workflows that run across millions of brand requests.
In daily production, Browserless plays a critical role in fetching fully rendered HTML, capturing screenshots, and enabling Brandfetch to extract brand assets with confidence. The API behaves predictably and performs quickly and reliably, making it a dependable part of their data pipeline.
Engineering Efficiency: A Smart Build vs Buy Decision
Brandfetch has engineers that could build an internal browser rendering system, but the team recognized that doing so would require substantial dev time and effort, as well as long term maintenance that would distract from core product priorities.
By choosing Browserless, Brandfetch avoided the infrastructure burden and kept their engineers focused on building value into their product rather than supporting complex headless browser operations.
Customer Support Experience
Fast, responsive support has been a major benefit throughout the partnership. From the early days of integration to ongoing operations, quick responses from the Browserless team gave Brandfetch confidence that they could rely on the platform for critical infrastructure. This remains a valuable component of the relationship.
What’s Next for Brandfetch
Brandfetch is actively adopting Browserless V2 with CAPTCHA handling and stealth improvements to further enhance coverage on more complex websites.
As web protections become more sophisticated, Brandfetch sees Browserless as a critical partner in maintaining and expanding their coverage, ensuring they continue to deliver fast, accurate, and always up-to-date brand data to their customers.
Summary
For more than five years, Browserless has been a foundational part of Brandfetch’s data pipeline, enabling them to:
- Render modern, JavaScript-heavy websites
- Extract brand assets with confidence
- Process millions of monthly requests, and maintain a 30-day refresh cycle
- Avoid the overhead of running their own headless browser fleet
- Move faster thanks to an easy API and reliable performance
Browserless enables Brandfetch to stay focused on what matters: delivering the world’s most accurate, continuously updated brand identity data.
Looking to build a scalable web data pipeline?
Brandfetch’s experience shows how Browserless can reliably render modern websites and keep large datasets continuously up to date. If your product depends on accurate, fully rendered web data at scale, Browserless can help. Talk to our team to see how Browserless fits your workflow.
