The Chain of Implicit Trust

About this Study

The Web is a tangled mass of interconnected services, where websites import a range of external resources from various third-party domains. However, the latter can further load resources hosted on other domains. For each website, this creates a dependency chain underpinned by a form of implicit trust between the first-party and transitively connected third-parties. The chain can only be loosely controlled as first-party websites often have little, if any, visibility of where these resources are loaded from. This study (dataset is detailed in our paper ) performs a large-scale study of dependency chains in the Web, to find that around 50% of first-party websites render content that they did not directly load. Although the majority (84.91%) of websites have short dependency chains (below 3 levels), we find websites with dependency chains exceeding 30. Using VirusTotal, we show that 1.2% of these third-parties are classified as suspicious --- although seemingly small, this limited set of suspicious third-parties have remarkable reach into the wider ecosystem.

Paper and Dataset Download

Our paper has to appear in The Web Conference (WWW), May 2019.

A sample of dataset and scripts used in this paper is hosted at on Google Drive.

Contact Person

Please contact us on the following email for if you intend to use the dataset and scripts.

Muhammad Ikram: Muhammad.Ikram [at] mq.edu.au or engr.ikram [at] gmail.com

Collaboration

This is a collobrative work of:

Muhammad Ikram, Optus Macquarie University Cyber Security Hub, Macquarie University and Censored Planet, University of Michigan.
Rahat Masood UNSW and Data61-CSIRO.
Gareth Tyson, Queens Marry University of London
Mohamed Ali (Dali) Kaafar,Optus Macquarie University Cyber Security Hub, Macquarie University and Data61-CSIRO.
Noha Liozen, Data61-CSIRO.
Roya Ensafi, Censored Planet, University of Michigan.