Data Governance in the Digital Age | Data Minefield? How AI Is Prodding Governments to Rethink Trade in Data

Data Minefield? How AI Is Prodding Governments to Rethink Trade in Data

Published: April 3, 2018

Author: Susan Ariel Aaronson

Key Points

  • No nation alone can regulate artificial intelligence (AI) because it is built on cross-border data flows.
  • Countries are just beginning to figure out how best to use and to protect various types of data that are used in AI, whether proprietary, personal, public or metadata.
  • Countries could alter comparative advantage in data through various approaches to regulating data — for example, requiring companies to pay for personal data.
  • Canada should carefully monitor and integrate its domestic regulatory and trade strategies related to data utilized in AI.
M

any of the world’s leaders are focused on the opportunities presented by AI — the machines, systems or applications that can perform tasks that, until recently, could only be performed by a human. In September 2017, Russian President Vladimir Putin told Russian schoolchildren, “Whoever becomes the leader in this sphere will become the ruler of the world (Putin quoted in RT.com 2017). Many countries, including Canada, China, the United States and EU member states, are competing to both lead the development of AI and dominate markets for AI.1

Canadian Prime Minister Justin Trudeau had a different take on AI. Like Putin, he wants his country to play a leading role. At a 2017 event, he noted that Canada has often led in machine learning breakthroughs and stressed that his government would use generous funding and open-minded immigration policies to ensure that Canada remains a global epicentre of AI (Knight 2017). However, Trudeau had some caveats. While AI’s uses are “really, really exciting…it’s also leading us to places where maybe the computer can’t justify the decision (Trudeau quoted in Knight 2017). He posited that Canadian culture might offer the right guidance for the technology’s development: “I’m glad we’re having the discussion about AI here in a country where we have a charter of rights and freedoms; where we have a decent moral and ethical frame to think about these issues” (ibid.).

Canada alone cannot determine how AI is used because many applications and devices powered by AI depend on cross-border data flows to train them. In short, AI is a trade policy issue. The choices that nations make in governing AI will have huge implications for the digital economy, human rights and their nation’s future economic growth.

AI and Cross-border Data Flows

Every day, large amounts of data flows course through the internet, over borders and between individuals, firms and governments to power the internet and associated technologies. A growing portion of these data flows are used to fuel AI applications such as Siri, Waze and Google searches. Because many of these data flows are directly or indirectly associated with a commercial transaction, they are essentially traded. AI applications, “which use computational analysis of data to uncover patterns and draw inferences, depend on machine learning technologies that must ingest huge volumes of data, most often from a wide variety of sources” (BSA 2017). For example, when you ask a language translation app to translate where to find the best pommes frites in Paris, the app will rely on many other search queries from other apps, databases and additional sources of content. In another example, if you ask Watson, IBM’s AI-powered super computer2 to diagnose rare forms of cancer, it must first sift through some 20 million cancer research papers and draw meaningful conclusions by connecting various large data sets across multiple countries (Galeon and Houser 2016).

Canada alone cannot determine how AI is used because many applications and devices powered by AI depend on cross-border data flows to train them.

Not surprisingly, the average netizen is increasingly dependent on AI. A Northeastern University Gallup Poll survey of 3,297 US adults in 2017 found that 85 percent of Americans use at least one of six products with AI elements, such as navigation apps, music streaming services, digital personal assistants, ride-sharing aps, intelligent home personal assistants and smart home devices (Reinhart 2018). Some 79 percent of those polled said that AI has had a very or mostly positive impact on their lives so far (ibid.). However, most users probably do not know that trade agreements govern AI. Other polls reveal that if they did, they might call for stronger privacy requirements, better disclosure and a fuller national debate about how firms use algorithms and publicly generated data (CIGI-Ipsos 2017). 

The public needs such information to assess if these algorithms are being used unethically, used in a discriminatory manner (to favour certain types of people) or used to manipulate people — as was the case in recent elections (Hern 2017). Policy makers also need to better understand how companies and researchers use proprietary data, personal data, metadata (allegedly anonymized personal data) and public data to fuel AI so that they can develop effective regulation.

shutterstock_785985316 B&W-web.jpg
A growing portion of cross-border data flows are used to fuel AI applications such as Siri. Most users of products with AI elements are likely unaware that trade agreements govern AI. (Photo: Hadrian / Shutterstock.com)

The Current State of Trade Rules Governing AI  

Although the World Trade Organization (WTO) says nothing about data, data flows related to AI are governed by WTO rules drafted before the invention of the internet. Because this language was originally drafted to govern software and telecommunications services, it is implicit and out of date. Today, trade policy makers in Europe and North America are working to link AI to trade with explicit language in bilateral and regional trade agreements. They hope this union will yield three outputs: the free flow of information across borders to facilitate AI; access to large markets to help train AI systems; and the ability to limit cross-border data flows to protect citizens from potential harm consistent with the exceptions delineated under the General Agreement on Trade in Services. These exceptions allow policy makers to breach the rules governing trade in cross-border data to protect public health, public morals, privacy, national security or intellectual property, if such restrictions are necessary and proportionate and do not discriminate among WTO member states (Goldsmith and Wu 2006).

As of December 2017, only one international trade agreement, the Comprehensive and Progressive Agreement for Trans-Pacific Partnership (CPTPP), formerly the Trans-Pacific Partnership (TPP), includes explicit and binding language to govern the cross-border data flows that fuel AI. Specifically, the CPTPP (which is still being negotiated) includes provisions that make the free flow of data a default, requires that nations establish rules to protect the privacy of individuals and firms providing data (a privacy floor), bans data localization (requirements that data be produced or stored in local servers) and bans all parties from requiring firms to disclose source code. These rules reflect a shared view among the 11 parties: nations should not be allowed to demand proprietary information when facilitating cross-border data flows.3

The United States (which withdrew from the TPP) wants even more explicit language related to AI as it works with Mexico and Canada to renegotiate NAFTA.

The United States (which withdrew from the TPP) wants even more explicit language related to AI as it works with Mexico and Canada to renegotiate the North American Free Trade Agreement (NAFTA). The United States has proposed language that bans mandated disclosure of algorithms as well as source code (Office of the United States Trade Representative 2017). The United States wants to ensure that its firms will not be required to divulge their source code or algorithms even if the other NAFTA parties require such transparency to prevent firms from using such algorithms in a discriminatory manner, to spread disinformation or in ways that could undermine their citizens’ ability to make decisions regarding their personal information (autonomy). Hence, the United States is using trade rules to “protect” its comparative advantage in AI.

Like most trade agreements, the CPTPP and NAFTA also include exceptions, where governments can breach the rules delineated in these agreements to achieve legitimate domestic policy objectives. These objectives include rules to protect public morals, public order, public health, public safety and privacy related to data processing and dissemination. However, governments can only take advantage of the exceptions if they are necessary, performed in the least trade-distorting manner possible and do not impose restrictions on the transfer of information greater than what is needed to achieve that government’s objectives. Policy makers will need greater clarity about how and when they can take these steps to protect their citizens against misuse of algorithms.

AI Strategies, Domestic Regulation and Trade

Some states and regions are developing very clear and deliberate policies to advance AI both within and beyond their borders. China’s free trade agreements do not contain binding rules on data flows or language on algorithms. But the country uses the lure of its large population, relatively low and poorly enforced privacy regulations, and subsidies to encourage foreign companies to carry out AI research in China. At the same time, the United States seems to be using trade agreements to build beyond its 318 million people to achieve economies of scale and scope in data (Aaronson and LeBlond, forthcoming 2018).

However, the European Union seems to be taking the most balanced approach, recognizing that it cannot encourage AI without maintaining online trust among netizens that their personal data will be protected. The 28 (soon to be 27) member states of the European Union agreed4 to create a digital single market as a key part of their customs union.5 The European Commission also launched a public consultation and dialogue with stakeholders to better understand public concerns about the use of data.6  In 2016, the European Union adopted the General Data Protection Regulation (GDPR), which replaces the Data Protection Directive. The GDPR takes effect on May 25, 2018 and provides rules on the use of data that can be attributed to a person or persons.7 In October 2017, the European Commission proposed a new regulation “concerning the respect for private life and the protection of personal data in electronic communications” to replace the outdated e-Privacy Directive (European Commission 2017b).

The GDPR has important ramifications for companies that use AI. First, the regulation applies to all companies that are holding or processing data from EU citizens whether or not they are domiciled in the European Union. Second, it gives citizens the ability to challenge the use of algorithms in two ways. Article 21 allows anyone the right to opt out of ads tailored by algorithms. Article 22 of the GDPR allows citizens to contest legal or similarly significant decisions made by algorithms and to appeal for human intervention. Third, it uses disincentives to secure compliance. Companies that are found to violate the regulation will be “subject to a penalty up to 20 million euro or 4% of their global revenue, whichever is higher” (Wu 2017).

Analysts are speculating regarding the costs and benefits of this mixed approach of incentives to AI coupled with strong rules on data protection. Some analysts believe that firms may struggle to inform netizens as to why they used specific data sets, or to explain how a particular algorithm yielded x result (Jánošík 2017). Others contend that the regulation may not be as onerous as it seems; in fact, the regulation really states that people need to be informed on the use of algorithms, rather than specifically requiring that the use be clearly explained to the average citizen (Wachter, Mittelstadt and Floridi 2017). Still others find this strategy will have multiple negative spillovers: raising the cost of AI, reducing AI accuracy, damaging AI systems, constraining AI innovation and increasing regulatory risk. Nick Wallace and Daniel Castro (2018) noted that most firms do not understand the regulation or their responsibilities. In short, the regulation designed to build AI could undermine the European Union’s ability to use and innovate with AI.

Implications for Smaller and Developing Countries, including Canada

Countries are just beginning to figure out how best to use and to protect various types of data that could be used in AI, whether proprietary, personal, public or metadata. Most countries, especially developing countries, do not have significant expertise in AI. These states may be suppliers of personal data, but they do not control or process data. But policy makers and citizens, like those in industrialized countries, can take several steps to control data and extract rents from their personal data (Porter 2018).

These states may decide to shape their own markets by developing rules that require companies to pay them for data (Lanier 2013). Developing countries with large populations are likely to have the most leverage to adopt regulations that require firms to pay rents for their citizens’ data. In so doing, they may be able to influence comparative advantage in the data-driven economy.

Meanwhile, Canada will need to better integrate its trade and AI strategies. Canada has comparative advantage in AI, but its companies and researchers will need larger amounts of data than its 38 million people can provide (Aaronson 2017). Canada will need to use trade agreements to foster the data pools that underpin AI, while reassuring citizens that their personal data (whether anonymized or not) is protected. NAFTA renegotiations — assuming they are not undermined by US President Donald Trump — provide an opportunity to begin a different discussion in North America on AI. Canada’s AI sector is closely integrated with that of the United States; both nations need to encourage the data flows that power AI while simultaneously protecting citizens from misuse or unethical use of algorithms. A forthcoming CIGI paper will discuss how Canada might create a new approach to data-driven trade, regulating data not just by the type of service, but instead by the variant of data.

 

1 See, for example, Mozur (2017) and LeVine (2018).
2 See Ng (2016).
3 See http://dfat.gov.au/trade/agreements/not-yet-in-force/tpp/summaries/Documents/electronic-commerce.PDF.
4 See https://ec.europa.eu/digital-single-market/en/policies/shaping-digital-single-market.
5 See https://ec.europa.eu/digital-single-market/en/policies/building-european-data-economy and European Commission (2017a).
6 See https://ec.europa.eu/digital-single-market/en/news/synopsis-report-public-consultation-building-european-data-economy.
7 EC, Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation) and Directive (EU) 2016/680 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data by competent authorities for the purposes of the prevention, investigation, detection or prosecution of criminal offences or the execution of criminal penalties, and on the free movement of such data, and repealing Council Framework Decision 2008/977/JHA, [2016] OJ, L119. online: <http://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=OJ:L:2016:119:FULL&from=EN>.

Works Cited

Aaronson, Susan Ariel. 2017. Information Please: A Comprehensive Approach to Digital Trade Provisions in NAFTA 2.0. CIGI Paper No. 154. Waterloo, ON: CIGI. www.cigionline.org/sites/default/files/documents/Paper%20no.154web.pdf.

Aaronson, Susan Ariel and Patrick Le Blond. Forthcoming 2018. “Another Digital Divide: The Rise of Data Realms and Its Implications for the WTO.” Journal of International Economic Law.

BSA. 2017. "Hearing on 21st Century Trade Barriers: Protectionist Cross Border Data Flow Policies’ Impact on U.S. Jobs." Testimony of Victoria Espinel, President and CEO, BSA and The Software Alliance, October 12. http://docs.house.gov/meetings/IF/IF17/20171012/106381/HHRG-115-IF17-Wstate-EspinelV-20171012.pdf.

CIGI-Ipsos. 2017. “2017 CIGI-Ipsos Global Survey on Internet Security and Trust.” www.cigionline.org/internet-survey.

European Commission. 2017a. “Commission Staff Working Document on the free flow of data and emerging issues of the European data economy.”

———. 2017b. “Proposal for a Regulation of the European Parliament and of the Council concerning the respect for private life and the protection of personal data in electronic communications and repealing Directive 2002/58/EC (Regulation on Privacy and Electronic Communications).” http://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=CELEX:52017PC0010&from=EN.

Galeon, Dom and Kristin Houser. 2016. “IBM’s Watson AI Recommends Same Treatment as Doctors in 99% of Cancer Cases.” Futurism, October 28. https://futurism.com/ibms-watson-ai-recommends-same-treatment-as-doctors-in-99-of-cancer-cases/.

Goldsmith, Jack and Tim Wu. 2006. Who Controls the Internet? Illusions of a Borderless World. New York, NY: Oxford University Press.

Hern, Alex. 2017. “How social media filter bubbles and algorithms influence the election.” The Guardian, May 22. www.theguardian.com/technology/2017/may/22/social-media-election-facebook-filter-bubbles.

Jánošík, Juraj. 2017. “Transparency of machine-learning algorithms is a double-edged sword.” November 13. www.welivesecurity.com/2017/11/13/transparency-machine-learning-algorithms/.

Knight, Will. 2017. “Why Artificial Intelligence Should Be More Canadian.” The Download, MIT Technology Review, October 26. www.technologyreview.com/the-download/609239/why-artificial-intelligence-should-be-more-canadian/.

Lanier, Jaron. 2013. Who Owns the Future? New York, NY: Simon and Schuster.

LeVine, Steve. 2018. “Chinese AI isn’t beating the U.S. yet — and may never catch up.” Axios, March 14. www.axios.com/chinese-ai-isnt-beating-the-us-yet-0cf27b7d-fe89-48e6-a5da-a7a5a3a1b84d.html.

Mozur, Paul. 2017. “Beijing Wants A.I. to Be Made in China by 2030.” The New York Times, July 20. www.nytimes.com/2017/07/20/business/china-artificial-intelligence.html.

Ng, Alfred. 2016. “IBM’s Watson gives proper diagnosis for Japanese leukemia patient after doctors were stumped for months.” New York Daily News, August 7. www.nydailynews.com/news/world/ibm-watson-proper-diagnosis-doctors-stumped-article-1.2741857.

Office of the United States Trade Representative. 2017. “Summary of the Objectives for the NAFTA Renegotiation.” November. https://ustr.gov/sites/default/files/files/Press/Releases/Nov%20Objectives%20Update.pdf.

Porter, Eduardo. 2018. “Your Data Is Crucial to a Robotic Age. Shouldn’t You Be Paid for It?” The New York Times, March 17, www.nytimes.com/2018/03/06/business/economy/user-data-pay.html.

Reinhart, R. J. 2018. “Most Americans Already Using Artificial Intelligence Products.” Gallup, March 6. http://news.gallup.com/poll/228497/americans-already-using-artificial-intelligence-products.aspx.

RT.com. 2017. “’Whoever leads in AI will rule the world’: Putin to Russian children on Knowledge Day.” RT.com, September 1. www.rt.com/news/401731-ai-rule-world-putin/.

Wachter, Sandra, Brent Mittelstadt and Luciano Floridi. 2017. “Why a Right to Explanation of Automated Decision-Making Does Not Exist in the General Data Protection Regulation.” International Data Privacy Law 7 (2): 76–99. https://academic.oup.com/idpl/article/3860948.

Wallace, Nick and Daniel Castro. 2018. “The Impact of the EU’s New Data Protection Regulation on AI.” Center for Data Innovation. www2.datainnovation.org/2018-impact-gdpr-ai.pdf.

Wu, Pomin. 2017. “GDPR and its impacts on machine learning applications.” Medium, November 6. https://medium.com/trustableai/gdpr-and-its-impacts-on-machine-learning-applications-d5b5b0c3a815.

The opinions expressed in this article/multimedia are those of the author(s) and do not necessarily reflect the views of CIGI or its Board of Directors.

About the Author

Susan Ariel Aaronson is a CIGI senior fellow, research professor of international affairs at George Washington University and co-principal investigator with the National Science Foundation/National Institute of Standards and Technology, where she leads research on data and AI governance.

Rationale of a Data Strategy

The Role of a Data Strategy for Canadian Industries

Balancing Privacy and Commercial Values

Domestic Policy for Data Governance

International Policy Considerations

Epilogue