WHAT TO FIX’s Meta Monetization Archive is the world’s only searchable archive of Meta’s partner-publisher disclosures, backed up since 2019.
Platforms:
Data source: https://www.facebook.com/brand_safety/publisher_lists (about the data)
Github:meta-monetization-archiveWHAT-TO-FIX • Updated Sep 11, 2025
Using the Archive
Wondering who may have made money from Facebook and Instagram? Explore the dashboard.
Available Filters:🔎 Keyword🔎 Monetization Program🔎 Page Language🔎 Primary Admin Location🔎 Date Range🔎 Follower Count
Note: The archive is released as is. With nearly 3 billion records, it can struggle a bit with complex and concurrent queries.
Found anything interesting? Do let us know. We would love to help amplify and feature your publications!
And please make sure to attribute the data to the WHAT TO FIX Meta Monetization Archive. It helps us make the case for continued maintenance!
Data Collection
Note: as of 2026, Meta is the only company releasing such data.
Data Processing
Did we make any changes to the data? We wish all we had to do was consolidate the files into a database. But the truth is, Meta’s raw data files were far from perfect, and we had to set up validation and correction steps.
We did it all computationally and published the code on GitHub, so that it can be independently audited.
Data Quality Issues
Inconsistent Releases
Contrary to its stated commitment, Meta does not release lists daily. We also failed to capture files on certain dates.
Solution: We defaulted to using the date of the last known file in lieu of a session’s end_date.
Limitation: Sessions’ end_date may be a few days earlier than the real end_date.
Solution: We defaulted to using the date of the last known file in lieu of a session’s end_date.
Limitation: Sessions’ end_date may be a few days earlier than the real end_date.
Data Standardization
We ran into different versions of the same language (e.g. Bengali/Bangla, Punjabi/Panjabi, Sinhalese/Sinhala) and country.
Solution: We standardized the fields as part of our data processing flow.
Limitation: Those fields may no longer match those from the Meta raw data.
Solution: We standardized the fields as part of our data processing flow.
Limitation: Those fields may no longer match those from the Meta raw data.
Missing Data
We encountered a number of entries which lacked an ID, making them impossible to process.
Solution: We disregarded these entries in the archive.
Limitation: The total number of records on impacted days may no longer match the Meta raw data.
Solution: We disregarded these entries in the archive.
Limitation: The total number of records on impacted days may no longer match the Meta raw data.
Missing Date_Added
Until 2020-10-04, the date_added field was framed as “new in the last 30 days” with the option of a “no” input.
Solution: We applied an algorithm to deduce missing date_added fields based on later data.
Limitation: For a limited number of early pages, which did not monetize for long, we lacked a specific date_added. If the account was marked as active for more than 30 days on 2019-10-25, we defaulted to a date_added of 2019-09-25.
Solution: We applied an algorithm to deduce missing date_added fields based on later data.
Limitation: For a limited number of early pages, which did not monetize for long, we lacked a specific date_added. If the account was marked as active for more than 30 days on 2019-10-25, we defaulted to a date_added of 2019-09-25.
Date_Added Errors
We faced a number of different types of errors with the date_added field, which is meant to reflect the latest date of onboarding. This included date_added predating the existence of the platform/program, date_added in the future, and date_added contradicting the record of inclusion in disclosures files.
Solution: we applied an algorithm to correct the various identified errors.
Limitation: the total number of records on impacted days may no longer match the raw data.
Solution: we applied an algorithm to correct the various identified errors.
Limitation: the total number of records on impacted days may no longer match the raw data.
Ghost Records
We encountered a number of records which, upon inspection, did not appear to be live on the platform at the time of their disclosure. These records shared a consistent pattern, with their account name (publisher), handle (username) and subscribers being empty.
Solution: we disregarded these entries in the archive.
Limitation: the total number of records on impacted days may no longer match the raw data.
Solution: we disregarded these entries in the archive.
Limitation: the total number of records on impacted days may no longer match the raw data.
Got a question or feedback? Notice anything that doesn’t look quite right?
Get in touch at meta-monetization-archive@whattofix.tech.
You can also sign up for our Monetization Watch newsletter for regular updates on monetization developments.
Monetization.wtf is maintained by WHAT TO FIX, with financial support from Luminate.
©️CC BY-ND 4.0 | Terms of use


