Dall-E and toppng.com |
Just as AI systems seem to be creating a world of their own through various 'hallucinations', Google has announced a competition between now and mid-September to help develop ways for AI systems to unlearn by removing "the influence of a specific subset of training examples — the "forget set" — from a trained model." This is key to allow individuals to exercise their rights 'to be forgotten', to object to processing, restrict processing or rectify errors under EU and UK privacy regulation, for example: Google accepts that in some cases it's possible to infer that an individual's personal data was used to train an AI model even after the personal data was deleted. But what does machine unlearning mean for the 'holy grail' of general artificial intelligence?
Unlearning is intended to be a cost effective alternative to completely retraining the AI model from scratch with the "forget set" removed from the training dataset. The idea is to remove certain data and its 'influence' while retaining the accuracy or fairness of an AI model and its ability to generalize in ways that have already been held out as examples of what the model can achieve.
A problem with approaches to 'machine unlearning' to date has been inconsistency in the measures for evaluating their effectiveness, making comparisons impracticable.
By standardizing the evaluation metrics Google hopes to identify the strengths and weaknesses of different algorithms and spark broader work on this aspect of AI.
As part of the challenge, Google will offer a set of information, some of which must be forgotten if unlearning is successful: the unlearned model should contain no traces of the forgotten examples, so that 'membership inference attacks' (MIAs) would be unable to infer that any of them was part of the original training dataset.
Perhaps unlike the problem of hallucinations or fabrication (from which humans also suffer) - the advent of 'machine unlearning' provides another reason why 'artificial general intelligence' - a computer's ability to replicate human intelligence - will remain elusive, since humans often forget things only to recall them later, or are unable to recall events or aspects of them that we witnessed firsthand and/or were 'supposed' to remember (like an accident or a birthday or wedding anniversary).