Please note: The algorithm descriptions in English have been automatically translated. Errors may have been introduced in this process. For the original descriptions, go to the Dutch version of the Algorithm Register.

Octobox Anonymisation

Recognise and anonymise privacy-sensitive information in documents.

Last change on 3rd of July 2024, at 13:49 (CET) | Publication Standard 1.0
Publication category
Other algorithms
Impact assessment
Field not filled in.
Status
In use

General information

Theme

Organisation and business operations

Begin date

2023-09

Contact information

https://www.ozhz.nl/contact 

Link to publication website

https://www.ozhz.nl

Responsible use

Goal and impact

The anonymisation tool is used to give substance to transparency and the necessary protection of the individuals and companies to whom documents relate. The impact of the algorithm on citizens and companies is low. The algorithm searches specifically for (personal) data and masks them regardless of the further content of documents. Currently, OZHZ uses this tool especially for anonymising soil reports. Later, the tool will be part of the document management system (DMS) and will be applied more broadly from the same background.

Considerations

Using the anonymisation tool speeds up and simplifies the process for passive and, at present, especially active disclosure of soil reports. Automated anonymisation is also less error-prone than human intervention. This reduces the risk of a data leak and better protects citizens' and companies' data.

Human intervention

The anonymisations proposed by the anonymisation tool are approved or disapproved by an employee in all cases when it comes to soil reports to be actively disclosed and decisions to be formally published (in due course), as well as when dealing with WOO requests. Random checks are performed for soil reports to be anonymised.

Risk management

An extensive testing process took place before the tool went live. New versions are tested before going live. To mitigate the risk that documents are insufficiently anonymised, human verification takes place, depending on the type of document randomly or a full check.

Legal basis

General data protection regulation (AVG)

Environment Act

General Administrative Law Act (AWB)

Disclosure Act

Open Government Act (WOO)

Electronic Publications Act (WEP)

Links to legal bases

  • Algemene verordening gegevensbescherming (AVG): https://wetten.overheid.nl/BWBR0040940
  • Omgevingswet: https://wetten.overheid.nl/BWBR0037885
  • Algemene Wet Bestuursrecht (AWB): https://wetten.overheid.nl/BWBR0005537
  • Bekendmakingswet: https://wetten.overheid.nl/BWBR0004287
  • Wet Open Overheid (WOO): https://wetten.overheid.nl/BWBR0045754
  • Wet Elektronische Publicaties (WEP): https://wetten.overheid.nl/BWBR0043961

Operations

Data

Value lists are used: which data should not be visible and which should be.

Technical design

The anonymisation tool works with Natural Language Processing (NLP).NLP is a part of Artificial Intelligence (AI) where language and artificial intelligence come together. For example, texts can be classified by recognising what the subject of the sentence is or what a verb or name is, for example. Through AI, the software recognises more and more terms.

External provider

Octobox Netherlands B.V.

Similar algorithm descriptions

  • Recognising and anonymising privacy-sensitive information in documents

    Last change on 30th of May 2024, at 14:12 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Recognising and anonymising privacy-sensitive information in documents

    Last change on 12th of June 2024, at 6:53 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Recognise and anonymise privacy-sensitive information and documents.

    Last change on 14th of October 2024, at 13:17 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    DPIA
    Status
    In use
  • Recognising and anonymising privacy-sensitive information in documents and other information sources.

    Last change on 23rd of October 2024, at 13:58 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Octobox Anonymisation is used when making information, such as personal data, unreadable (varnishing). This is mainly done in requests under the Open Government Act (Woo). The basis of the algorithm is formed by AVG rules. In addition, the algorithm has self-learning properties based on human input.

    Last change on 11th of September 2024, at 15:14 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    DPIA, ...
    Status
    In use