Please note: The algorithm descriptions in English have been automatically translated. Errors may have been introduced in this process. For the original descriptions, go to the Dutch version of the Algorithm Register.

Anonymisation software

Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

Last change on 10th of April 2024, at 14:17 (CET) | Publication Standard 1.0
Publication category
Other algorithms
Impact assessment
Field not filled in.
Status
In use

General information

Theme

Organisation and business operations

Begin date

04-2023

Contact information

info@hhnk.nl

Link to publication website

www.hhnk.nl

Responsible use

Goal and impact

The anonymisation software is used to give substance to transparency, on the one hand, and the necessary protection of the individuals and companies to whom the documents relate, on the other.

Considerations

Hoogheemraadschap Hollands Noorderkwartier has to deal with various laws and regulations where the organisation actively discloses information on request. This information may contain privacy-sensitive information. In doing so, it is important that this information is anonymised. Anonymising data by hand is a time-consuming task that also involves errors and data leaks. Anonymisation software enables users to anonymise personal and confidential information themselves in an efficient manner.

Human intervention

The application supports employees in quickly and carefully filing away data. The documents anonymised by means of the software are always checked by an employee afterwards and adjusted if necessary. The algorithm itself is continuously retrained.

Risk management

Risk is minimal because the software does not make decisions. The software makes a proposal for anonymising data and information. The Water Board employee always does the final check that a document is correctly anonymised.

Legal basis

WOO and AVG

Links to legal bases

  • AVG: https://eur-lex.europa.eu/legal-content/NL/TXT/HTML/?uri=CELEX:31995L0046
  • WOO: https://wetten.overheid.nl/BWBR0045754/2023-04-01

Operations

Data

This depends on the document being anonymised. Examples include personal data such as name and initials, mail addresses, phone numbers, financial data, bank account numbers, address details, and signatures. If anonymisation is done on the basis of the Open Government Act (Woo), it may also involve data beyond personal data. These grounds for exemption are listed in the Woo.

Technical design

Documents are fed into Datamask's algorithm. The software uses pattern recognition and Natural Language Processing to look for names, addresses, dates of birth, specific set words, signatures or regular expressions (such as e-mail, IBAN, BSN). The DataMask software recognises these and makes suggestions to mask or anonymise them fully automatically. The employee chooses which data to mask. The outcome is a document on which the necessary data are irreversibly masked. In the training of this algorithm by the supplier, no data of the Water Board are used.

External provider

Datamask B.V.

Link to code base

Persoonsgegevens in Documenten Anonimiseren met DataMask

Similar algorithm descriptions

  • Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

    Last change on 8th of April 2024, at 17:15 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

    Last change on 5th of September 2024, at 14:30 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    DPIA
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

    Last change on 4th of April 2024, at 9:22 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

    Last change on 8th of April 2024, at 17:05 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

    Last change on 9th of April 2024, at 7:18 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use