As fallout from the revelation of Cambridge Analyticas misuse of Facebook user information continues, many are mistakenly calling this incident a breach. Facebook is right to claim this incident was no breachthis is Facebooks platform working exactly as designed.
I know, because I too created a survey app on Facebook for the express purpose of harvesting users profile information. Except in my case, my academic research project was not engaged in deception and theft, and my co-authors and I used the data we gathered in part to understand both how much data we could obtainlegitimatelythrough the protocols made available to third-party app developers, and whether Facebook users understood what was occurring.
Facebook opened itself up to third-party developers back in 2007. Apps deployed on Facebook quickly became the rage, from Words with Friends to the many iterations of Farmville and the like.
There were no barriers to sign up as a Facebook developer; literally, anyone who could build an app could do it. Much like any online sign-up process, one needed only to agree to the sites developer terms and conditions and have enough technical proficiency to deploy an app. If you could convince other Facebook users to install it on their profiles, as a developer you were instantly rewarded with a trove of data that exceeded what was available on a users profile.
My colleagues and I did this in 2010, creating an app we called What kind of Facebook user are you? In our project approved by UC Berkeleys Institutional Review Board, we conducted a survey where we asked Facebook users questions about how third-party apps worked, the types of data apps could access from their profiles, and their privacy concerns. We published our findings in 2011.
The process for launching the app was trivial and provided us with not only with the users data, but also a smaller set of data from their entire social graph: all of their Facebook friends. It is this feature of the API (now discontinued) that allowed Cambridge researcher Dr. Aleksandr Kogan to vastly expand his dataset past the few hundred thousand who actually responded to his survey. For example, our 516 respondents had over 129,000 friends, whose basic profile information we also could have gathered.
Many of us in the privacy research community were aghast at the Facebook APIs leniency when it launched, and speculated that an incident like this was inevitable. As researchers who focus on privacy and social media, we wanted to understand whether Facebook users understood exactly what was happening when they elected to add an app to their profile. At the time we conducted our survey, about half of our respondents did not know that apps could access and store their profile data off of Facebooks site, or that when they added an app, it could also access the basic profile data of anyone on their friends list.
Further, we asked specifically about the kinds of practices Cambridge Analytica engaged in: over 90 percent of our respondents were somewhat or very uncomfortable with the possibility that an app could sell their profile information, store their information permanently on its own servers, or share that data with other companies.
The Facebook Platform API was notable for its obvious lack of concern for users data. Although the company eventually limited access to some profile data and removed the open access to the users social graphs, Facebook did not screen its third-party developers (and still does not). As this incident shows, the company cannot meaningfully police its developers. When one considers the substantial number of third-party apps offered across their platform, there is no practical way Facebook can ensure that there arent hundredsor even thousandsof additional Cambridge Analyticas lurking out there and engaging in similar behavior.
Herein lies the conflict: the companies that traffic in user data in the United States have worked hard to fight any legislative oversight. Instead, they promise to self-regulate, yet as this incident demonstrates, we cant trust them to enforce the terms of their own business agreements. To be fair, this problem is not limited to Facebook; a lack of oversight has repeatedly put Americans privacy at risk across the entire technology and information industry. Congress has repeatedly failed to pass legislation protecting Americans from the most egregious information collection practices.
Since we cant rely on Congress to act, at a minimum Facebook should either discontinue or drastically alter its developer program to ensure that only a bare minimum of user profile data is ever shared outside the company. Further, the company should engage in active oversight of any third parties that have access to its users data to ensure that anyone who seeks to engage with its users does so with benign intentions. Given that weve already seen how poorly the company has supervised its advertising sales operations, Im pessimistic they can accomplish this feat.
In its effort to quickly dominate the social networking landscape, Facebook put growth ahead of its users. Now that the inevitable has occurred, we see that it wasnt just Facebook users who were put in jeopardy. Rather, the health and future of our democracy is at stake.
Cross-posted from Medium.