Introducing privacy in current web search engines

Abstract : During the last few years, the technological progress in collecting, storing and processing a large quantity of data for a reasonable cost has raised serious privacy issues. Privacy concerns many areas, but is especially important in frequently used services like search engines (e.g., Google, Bing, Yahoo!). These services allow users to retrieve relevant content on the Internet by exploiting their personal data. In this context, developing solutions to enable users to use these services in a privacy-preserving way is becoming increasingly important. In this thesis, we introduce SimAttack an attack against existing protection mechanism to query search engines in a privacy-preserving way. This attack aims at retrieving the original user query. We show with this attack that three representative state-of-the-art solutions do not protect the user privacy in a satisfactory manner. We therefore develop PEAS a new protection mechanism that better protects the user privacy. This solution leverages two types of protection: hiding the user identity (with a succession of two nodes) and masking users' queries (by combining them with several fake queries). To generate realistic fake queries, PEAS exploits previous queries sent by the users in the system. Finally, we present mechanisms to identify sensitive queries. Our goal is to adapt existing protection mechanisms to protect sensitive queries only, and thus save user resources (e.g., CPU, RAM). We design two modules to identify sensitive queries. By deploying these modules on real protection mechanisms, we establish empirically that they dramatically improve the performance of the protection mechanisms.
Document type :
Theses
Complete list of metadatas

Cited literature [159 references]  Display  Hide  Download

https://hal.inria.fr/tel-01492488
Contributor : Abes Star <>
Submitted on : Wednesday, April 25, 2018 - 12:41:07 PM
Last modification on : Friday, May 17, 2019 - 10:21:53 AM
Long-term archiving on : Thursday, September 20, 2018 - 7:08:11 AM

File

these.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01492488, version 2

Citation

Albin Petit. Introducing privacy in current web search engines. Web. Université de Lyon; Universität Passau, 2017. English. ⟨NNT : 2017LYSEI016⟩. ⟨tel-01492488v2⟩

Share

Metrics

Record views

247

Files downloads

674