Boffins at Microsoft Research have devised a way to automatically check code for compliance with privacy laws – and, according to Redmond, it's so simple that even non-techies can use it successfully.
Microsoft Research (MSR) has developed a programming language called Legalease, to be used together with its data inventory engine, Grok, which it claims can comb millions of lines of constantly changing code and check that it complies with rules on privacy.
Legalease is intended for use by non-programming types to specify restrictions on how data is handled. One of the main drivers behind Legalease's development was that software developers and those setting companies’ privacy policies don’t share a common language.
MSR says that more than 20 per cent of the code in Bing changes on a daily basis, with changes made by thousands of programmers. Changes in code might affect how data is used or who views it, potentially violating company, government or regulatory privacy policies.
Keeping tabs on changes in very large systems, like the Bing search engine, using manual audits is difficult. According to MSR automated testing is the best way to verify compliance with privacy rules and laws on the massive scale demanded in environments like Bing.
Grok, meanwhile, annotates existing code using a system that cross-references information from different sources, based on varying levels of confidence.
According to Microsoft, pattern-matching to column names across a database results in a low-confidence score, while annotations made manually by developers are deemed to be more trustworthy and thus get a high-confidence score.
MSR says it developed Grok for use on Bing but found writing suitable polices hard – and this was what led to Legalese. Both were tested on Bing and are now running on the data analytics pipeline of Microsoft's search engine.
MSR presented Legalese and Grok at the 35th IEEE Symposium on Security and Privacy in San Jose, California, this week. Redmond claims a group of non-coders took less than five minutes to learn how to use Legalease and just 15 minutes to code nine Bing policy clauses on the use of user information.
Saikat Guha, a researcher at Microsoft, in a statement called Legalease “the final piece of the automated privacy compliance jigsaw puzzle."
You can read more on the Microsoft Research site here. ®