{"id":235,"date":"2016-06-07T14:27:11","date_gmt":"2016-06-07T14:27:11","guid":{"rendered":"https:\/\/blogs.mathworks.com\/headlines\/?p=235"},"modified":"2017-07-05T12:19:17","modified_gmt":"2017-07-05T12:19:17","slug":"your-car-doesnt-have-to-be-smart-to-be-a-privacy-concern","status":"publish","type":"post","link":"https:\/\/blogs.mathworks.com\/headlines\/2016\/06\/07\/your-car-doesnt-have-to-be-smart-to-be-a-privacy-concern\/","title":{"rendered":"Your car doesn\u2019t have to be \u201csmart\u201d to be a privacy concern"},"content":{"rendered":"<p>Your <a href=\"http:\/\/www.dataprivacymonitor.com\/online-privacy\/legal-developments-in-connected-car-arena-provide-glimpse-of-privacy-and-data-security-regulation-in-internet-of-things\/\" target=\"_blank\">smart car can tattle <\/a>on you. It not only knows where you you have been, but also how fast you were driving and how hard you brake. It\u00a0can tell if you were the driver or if someone else was behind the wheel. Due to the connected nature of smart cars, it\u00a0can share this data over web-based platforms.<\/p>\n<p>Over the last few decades, automobiles evolved from modes of transport into sensor-laden mobile computing platforms. While the sensor-generated data has enabled breakthroughs in safety features and performance, it also creates privacy concerns for drivers.<\/p>\n<p>But your car doesn\u2019t have to be smart to be\u00a0a privacy concern. Cars that aren&#8217;t connected can also create\u00a0trackable driver passports.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-238 size-full\" src=\"https:\/\/blogs.mathworks.com\/headlines\/files\/feature_image\/speedometer-data.jpg\" alt=\" car speedometer generates data\" width=\"306\" height=\"204\" \/><\/p>\n<h2><span style=\"color: #e8941a;\"><strong>Even a\u00a020-year-old car can share\u00a0your driving data<\/strong><\/span><\/h2>\n<p>Event data recorders\u00a0(EDRs) have been built into cars for years. The data from these in-vehicle black boxes and other car sensor data is accessible to mechanics via on-board diagnostic systems (OBDs), which have been required in cars sold in the United States since 1996 and in the EU since 2001. For cars that aren\u2019t considered &#8216;smart&#8217;, the\u00a0data is not wirelessly transmitted and is only accessible from inside the car itself.<\/p>\n<p>In recent years, these devices have allowed\u00a0police to reconstruct crash scenes with actual speed and braking data. Insurance companies have utilized data from EDRs to assign accident blame. According to the Insurance Institute for Highway Safety, <a title=\"http:\/\/www.iihs.org\/iihs\/topics\/t\/event-data-recorders\/qanda (link no longer works)\" target=\"_blank\">\u201cinsurers may be able to access the EDRs in their policyholders&#8217; vehicles based on provisions in the insurance contract.\u201d<\/a><\/p>\n<h2><span style=\"color: #e8941a;\"><strong>According to police, nothing that would identify driver<\/strong><\/span><\/h2>\n<p>Police have acknowledged that some people are concerned about privacy when it comes to EDRs, and that many worry\u00a0that the law enforcement agencies and insurance providers can use the data to spy on drivers.\u00a0They have maintained that there is nothing in the data\u00a0that would identify what&#8217;s going on in the cab with the person or <a href=\"http:\/\/www.cbc.ca\/news\/canada\/nova-scotia\/event-data-recorders-police-insurance-cars-air-bag-control-modules-1.3585539\" target=\"_blank\">what person is sitting in the driver&#8217;s seat<\/a>.<\/p>\n<p><strong>That last statement is no longer true.<\/strong> In a study published last month (see below), researchers from the University of Washington and the University of California, San Diego showed that the data from your car can be used to positively identify the driver with 100% accuracy after just minutes of driving.<\/p>\n<h2><span style=\"color: #e8941a;\"><strong>Machine learning and the sensors in your car can ID you in minutes<\/strong><\/span><\/h2>\n<p><em>A\u00a0WIRED\u00a0<\/em>report, <em><a href=\"https:\/\/www.wired.com\/2016\/05\/drive-car-can-id-within-minutes-study-finds\/\" target=\"_blank\">A Car\u2019s Computer Can \u2018Fingerprint\u2019 You in Minutes Based on How You Drive<\/a><\/em>, explained that with just a few minutes\u2019 worth of the information your car is already recording, the driver of a car can be accurately identified. In the <a href=\"http:\/\/www.autosec.org\/pubs\/fingerprint.pdf\" target=\"_blank\">study<\/a>, which the researchers plan to present at <em>the Privacy Enhancing Technology Symposium<\/em> this summer, the team used <a href=\"https:\/\/www.mathworks.com\/products\/matlab\/\" target=\"_blank\">MATLAB<\/a>\u00a0and machine learning\u00a0to accurately identify the drivers with data collected on the car\u2019s internal computer.<\/p>\n<p style=\"padding-left: 30px;\">\u201cWith very limited amounts of driving data we can enable very powerful and accurate inferences about the driver\u2019s identity,\u201d says Miro Enev, one of the authors of the study.<\/p>\n<p>To ensure the focus of the project was on differences in driving habits, the same car was used by all 15 test participants. The participants all drove the same route at the same time of day, and were even required to listen to the same radio station.<\/p>\n<p>The researchers collected data from the car\u2019s internal computer and used a machine learning algorithm to analyze the data for each\u00a0driver. The machine learning algorithm used\u00a090 percent of the driving data to train\u00a0the algorithm. The remaining 10 percent was used to test the algorithm.<\/p>\n<p>The trained machine learning algorithm was able to distinguish the drivers in the study with 100% accuracy with only 15 minutes of the driving data.<\/p>\n<h2><span style=\"color: #e8941a;\"><strong>Comparing machine learning\u00a0algorithms with MATLAB<\/strong><\/span><\/h2>\n<p>The researchers used various MATLAB toolboxes for the processing pipeline, from sensor signal processing to comparing performance of various machine learning algorithms. First, signal preprocessing was performed using the <a href=\"https:\/\/www.mathworks.com\/products\/signal\/\" target=\"_blank\">Signal Processing Toolbox<\/a> and\u00a0<a href=\"https:\/\/www.mathworks.com\/products\/wavelet\/?s_tid=srchtitle\" target=\"_blank\">Wavelet\u00a0Toolbox<\/a>:<\/p>\n<p style=\"padding-left: 30px;\">After the data is uniformly sampled, each sensor stream was\u00a0smoothed by applying wavelet denoising to remove high frequency artifacts. This operation involved multi-level stationary wavelet decomposition and subsequent reconstruction using the Haar wavelet with the default denoising threshold of the\u00a0MATLAB\u00a0<a href=\"https:\/\/www.mathworks.com\/help\/wavelet\/ref\/iswt.html\" target=\"_blank\">iswt<\/a>\u00a0command.<\/p>\n<p>The researchers then performed feature extraction and classification using methods available in the <a href=\"https:\/\/www.mathworks.com\/products\/statistics\/\" target=\"_blank\">Statistics and Machine Learning Toolbox<\/a>. After preprocessing the sensor signals, features that capture the statistical and morphological characteristics of the signals are derived. In total, 48 features are derived for each sensor and time segment.<\/p>\n<p>Random Forest, Support Vector Machine (SVM), <a href=\"https:\/\/www.mathworks.com\/help\/stats\/classificationknn-class.html\" target=\"_blank\">K-Nearest Neighbor<\/a> (KNN) and <a title=\"https:\/\/www.mathworks.com\/help\/stats\/naivebayes-class.html (link no longer works)\" target=\"_blank\">Naive Bayes<\/a> were the four classification algorithms that were trained and evaluated in this work. Of the four classifiers, Random Forest\u00a0yielded the best results.<\/p>\n<h2><span style=\"color: #e8941a;\"><strong>The privacy implications<\/strong><\/span><\/h2>\n<p>According to the article in <em>WIRED<\/em>, the ability to use this data could have unexpected privacy implications: everything from letting insurance companies punish drivers who loan their cars to their teenage kids, to identifying a driver who violated traffic laws or caused a collision.<\/p>\n<p style=\"padding-left: 30px;\">\u201cThe same data that tells their insurance company when they\u2019ve let their\u00a016-year-old kid take their car to prom might just as easily be used to identify drunk driving or a medical condition that\u2019s altered someone\u2019s driving ability, tests Enev claims\u00a0would actually be simpler\u00a0than trying to distinguish a driver\u2019s identity,\u201d says Andy Greenberg,\u00a0senior writer for WIRED.<\/p>\n<p>On March 15, the U.S. Senate Committee on Commerce, Science and Transportation met with leaders in the autonomous vehicle industry to discuss cybersecurity and privacy.\u00a0According to <em><a title=\"http:\/\/www.govtech.com\/fs\/insights\/Computerization-of-Cars-Gives-Rise-to-Concerns-About-Hacking-Privacy.html (link no longer works)\" target=\"_blank\">Government Technology Magazine<\/a><\/em>, \u201cThe questions that arose during the committee hearing were to what extent automakers could use that data. Could they collect it and sell it to third parties for marketing purposes? <strong>Could they tie information to specific people?<\/strong>\u201d\u00a0We now know the answer to that last question is a resounding yes.<\/p>\n","protected":false},"excerpt":{"rendered":"<div class=\"overview-image\"><img decoding=\"async\"  class=\"img-responsive\" src=\"https:\/\/blogs.mathworks.com\/headlines\/files\/feature_image\/speedometer-data.jpg\" onError=\"this.style.display ='none';\" \/><\/div>\n<p>Your smart car can tattle on you. It not only knows where you you have been, but also how fast you were driving and how hard you brake. It\u00a0can tell if you were the driver or if someone else was&#8230; <a class=\"read-more\" href=\"https:\/\/blogs.mathworks.com\/headlines\/2016\/06\/07\/your-car-doesnt-have-to-be-smart-to-be-a-privacy-concern\/\">read more >><\/a><\/p>\n","protected":false},"author":138,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts\/235"}],"collection":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/users\/138"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/comments?post=235"}],"version-history":[{"count":9,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts\/235\/revisions"}],"predecessor-version":[{"id":920,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/posts\/235\/revisions\/920"}],"wp:attachment":[{"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/media?parent=235"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/categories?post=235"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.mathworks.com\/headlines\/wp-json\/wp\/v2\/tags?post=235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}