Studies using OpenWPM ====================== Data collected by WebTAP ------------------------ Since 2015, WebTAP has conducted a web census to study third-party online tracking. Each month between 2015-2018, they visited the web’s 1 million most popular sites using OpenWPM and record data pertaining to user privacy, including cookies, fingerprinting scripts, the effect of browser privacy tools, and the exchange of tracking data between different sites (“cookie syncing”). WebTAP has `released `_ the entire Princeton Web Census data — about 15 terabytes — containing privacy measurements of 1 million sites conducted each month from December 2015 to June 2018. List of Studies that have used OpenWPM --------------------------------------- .. list-table:: :widths: 5 25 70 :header-rows: 1 * - Year - Venue - Study Name * - 2014 - ACM CCS - `The Web Never Forgets: Persistent Tracking Mechanisms in the Wild `_ * - 2014 - ACM CoSN - `Cognitive disconnect: Understanding Facebook Connect login permissions `_ * - 2015 - WWW - `Cookies that give you away: The surveillance implications of web tracking `_ * - 2015 - NDSS - `Upgrading HTTPS in midair: HSTS and key pinning in practice `_ * - 2015 - Tech Science - `Web privacy census `_ * - 2015 - W2SP - `Variations in tracking in relation to geographic location `_ * - 2016 - IFIP AICT - `Evaluating Websites and Their Adherence to Data Protection Principles `_ * - 2016 - ACM CCS - `Online Tracking: A 1-million-site Measurement and Analysis `_ * - 2016 - WWW - `No honor among thieves: A large-scale analysis of malicious web shells `_ * - 2017 - NDSS - `Dial One for Scam: A Large-Scale Analysis of Technical Support Scams `_ * - 2017 - PETS - `Cross-Device Tracking: Measurement and Disclosures `_ * - 2017 - CODASPY - `Identifying HTTPS-Protected Netflix Videos in Real-Time `_ * - 2017 - WWW - `De-anonymizing Web Browsing Data with Social Networks `_ [#f1]_ * - 2017 - IWPE - `Battery Status Not Included: Assessing Privacy in Web Standards `_ * - 2017 - Annual Privacy Forum - `PrivacyScore: Improving Privacy and Security via Crowd-Sourced Benchmarks of Websites `_ * - 2017 - arXiv - `Horcrux: A Password Manager for Paranoids `_ * - 2017 - USENIX Security - `Measuring the Insecurity of Mobile Deep Links of Android `_ * - 2017 - Applied Economics Letters - `Online advertising networks and consumer perceptions of privacy `_ * - 2018 - PETS - `When the cookie meets the blockchain: Privacy risks of web payments via cryptocurrencies `_ * - 2018 - PETS - `I never signed up for this! Privacy implications of email tracking `_ * - 2018 - ACM TOIT - `Measuring third party tracker power across web and mobile `_ * - 2018 - CALIcon - `Third Party Trackers on Law School Library Websites `_ * - 2018 - Master Thesis, Delft University of Technology - `Tracking Cookies in the European Union, an Empirical Analysis of the Current Situation `_ * - 2018 - ACM CCS - `The Web’s Sixth Sense: A Study of Scripts Accessing Smartphone Sensors `_ * - 2018 - ACSAC - `Raising the Bar: Evaluating Origin-wide Security Manifests `_ * - 2018 - arXiv - `The Unwanted Sharing Economy: An Analysis of Cookie Syncing and User Transparency under GDPR `_ * - 2018 - PhD thesis, Princeton University - `Automated discovery of privacy violations on the web `_ * - 2018 - AINTEC’18 - `Understanding abusive web resources: characteristics and counter-measures of malicious web resources and cryptocurrency mining `_ * - 2018 - ACSAC - `Raising the Bar: Evaluating Origin-wide Security Manifests `_ * - 2018 - SSRN - `Acquisitions in the Third Party Tracking Industry: Competition and Data Protection Aspects `_ * - 2019 - Communications in Computer and Information Science - `Transparency in Keyword Faceted Search: An Investigation on Google Shopping `_ * - 2019 - arXiv - `The Price of Free Illegal Live Streaming Services `_ * - 2019 - Advances in Intelligent Systems and Computing - `Usage of HTTPS by Municipal Websites in Portugal `_ * - 2019 - ConPro - `The Impact of User Location on Cookie Notices (Inside and Outside of the European Union) `_ * - 2019 - WWW - `Before and After GDPR: The Changes in Third Party Presence at Public and Private European Websites `_ * - 2019 - IEEE EuroS&P - `TraffickStop: Detecting and Measuring Illicit Traffic Monetization Through Large-Scale DNS Analysis `_ * - 2019 - SSRN - `The Market for Data Privacy `_ * - 2019 - ACM CSCW - `Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites `_ * - 2019 - Computer Communications - `A comparison of web privacy protection techniques `_ * - 2019 - DPM - `On Privacy Risks of Public WiFi Captive Portals `_ * - 2019 - Computers & Security - `Towards a global perspective on web tracking `_ * - 2019 - APF - `Towards Transparency in Email Tracking `_ * - 2019 - RAID - `Talon: An Automated Framework for Cross-Device Tracking Detection `_ * - 2019 - ACM CCS - `Watching You Watch: The Tracking Ecosystem of Over-the-Top TV Streaming Devices `_ * - 2019 - ACM IMC - `Tales from the Porn: A Comprehensive Privacy Analysis of the Web Porn Ecosystem `_ * - 2019 - IEEE EuroS&P - `TraffickStop: Detecting and Measuring Illicit Traffic Monetization Through Large-scale DNS Analysis `_ * - 2019 - The New York Times - `I Visited 47 Sites. Hundreds of Trackers Followed Me. `_ * - 2019 - The Washington Post - `Think you’re anonymous online? A third of popular websites are ‘fingerprinting’ you. `_ * - 2019 - ESORICS - `Fingerprint surface-based detection of web bot detectors `_ * - 2019 - DPM - `A Study on Subject Data Access in Online Advertising after the GDPR `_ * - 2019 - IEEE SPW - `After GDPR, Still Tracking or Not? Understanding Opt-Out States for Online Behavioral Advertising `_ * - 2020 - PETS - `Missed by Filter Lists: Detecting Unknown Third-Party Trackers with Invisible Pixels `_ * - 2020 - PETS - `Inferring Tracker-Advertiser Relationships in the Online Advertising Ecosystem using Header Bidding `_ * - 2020 - PETS - `A Comparative Measurement Study of Web Tracking on Mobile and Desktop Environments `_ * - 2020 - PETS - `No boundaries: data exfiltration by third parties embedded on web pages `_ * - 2020 - PETS - `In-Depth Evaluation of Redirect Tracking and Link Usage `_ * - 2020 - The Web Conference - `The Representativeness of Automated Web Crawls as a Surrogate for Human Browsing `_ [#f2]_ * - 2020 - The Web Conference - `Apophanies or Epiphanies? How Crawlers Impact Our Understanding of the Web `_ * - 2020 - The Web Conference - `Stop Tracking me Bro! Differential Tracking of User Demographics on Hyper-partisan Websites `_ [#f2]_ * - 2020 - The Web Conference - `Beyond the Front Page: Measuring Third Party Dynamics in the Field `_ * - 2020 - ACM ASIACCS - `Measuring the Impact of the GDPR on Data Sharing in Ad Networks `_ * - 2020 - arXiv - `Actions speak louder than words: Semi-supervised learning for browser fingerprinting detection `_ * - 2020 - PAM - `Extortion or Expansion? An investigation into the costs and consequences of ICANN’s gTLD experiments `_ * - 2020 - Bachelor Thesis, Radboud University - `Design and implementation of a stealthy OpenWPM web scraper `_ * - 2020 - IWPE - `On Compliance of Cookie Purposes with the Purpose Specification Principle `_ * - 2020 - FTC PrivacyCon - `Unaccounted Privacy Violation: A Comparative Analysis of Persistent Identification of Users Across Social Contexts `_ * - 2020 - IEEE EuroS&P - `Multi-country Study of Third Party Trackers from Real Browser Histories `_ * - 2020 - TMA - `Characterizing CNAME Cloaking-Based Tracking on the Web `_ * - 2020 - TMA - `Clash of the Trackers: Measuring the Evolution of the Online Tracking Ecosystem `_ * - 2020 - WEIS - `The Impact of the GDPR on Content Providers `_ * - 2020 - PhD Thesis, University of Michigan - `Enhancing System Transparency, Trust, and Privacy with Internet Measurement `_ * - 2020 - Masters Thesis, Concordia University - `A Large-Scale Evaluation of Privacy Practices of Public WiFi Captive Portals `_ * - 2020 - IEEE Globecom - `A machine learning approach for detecting CNAME cloaking-based tracking on the Web `_ * - 2021 - NDSS - `Reining in the Web’s Inconsistencies with Site Policy `_ * - 2021 - PETS - `Unveiling Web Fingerprinting in the Wild Via Code Mining and Machine Learning `_ * - 2021 - IEEE S&P - `Fingerprinting the Fingerprinters: Learning to Detect Browser Fingerprinting Behaviors `_ .. rubric:: Footnotes .. [#f1] Uses data released by us. .. [#f2] Studies OpenWPM’s behavior.