Please note: The algorithm descriptions in English have been automatically translated. Errors may have been introduced in this process. For the original descriptions, go to the Dutch version of the Algorithm Register.

Datamask

Algorithm recognises and anonymises (personal) data and confidential data in documents before they are published.

Last change on 15th of July 2024, at 7:15 (CET) | Publication Standard 1.0
Publication category
Other algorithms
Impact assessment
DPIA
Status
In use

General information

Theme

Organisation and business operations

Begin date

2023-12

Contact information

algoritmen@haarlem.nl

Responsible use

Goal and impact

The anonymisation software is used to give substance to transparency, on the one hand, and the necessary protection of the individuals and companies to whom the documents relate, on the other.

Considerations

The municipality of Haarlem has to deal with various laws and regulations that require information to be disclosed both actively and on request. This information may contain privacy-sensitive information, where it is important that this information is anonymised before disclosure. Anonymising data by hand is a time-consuming task where there is also a risk of human error resulting in potential data breaches. Anonymisation software enables users to anonymise personal and confidential information themselves in an efficient manner.

Human intervention

The documents anonymised through the software are checked by an employee. The employee determines whether the document has been correctly anonymised.

Risk management

DataMask creates a proposal for anonymising data and information. An employee of the municipality always does the final check whether a document is correctly anonymised. As a result, the risk is minimal.

Legal basis

General Data Protection Regulation (AVG), General Data Protection Regulation Implementation Act (UAVG) and Open Government Act (Woo)

Impact assessment

Data Protection Impact Assessment (DPIA)

Operations

Data

This depends on the document being anonymised. Examples include personal data such as e-mail addresses, phone numbers, bank account numbers, address details and signatures.

Technical design

Smart features, such as set rules or templates, make it possible to anonymise per document or as a bulk. In this way, the method and degree of anonymisation of commonly used (standard) documents can also be set. The software then uses pattern recognition and Natural Language Processing to search for names, addresses, dates of birth, specific set words, signatures or regular expressions (such as e-mail, IBAN, BSN). The DataMask software recognises these and makes suggestions to mask or anonymise them fully automatically.

External provider

Datamask

Similar algorithm descriptions

  • Algorithm recognizes and anonymizes (personal) data and confidential data in documents before they are published.

    Last change on 15th of July 2024, at 7:16 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    DPIA
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential financial data in documents before they are published or shared.

    Last change on 18th of April 2024, at 7:40 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential financial data in documents before they are published or shared.

    Last change on 13th of June 2024, at 7:42 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    DPIA
    Status
    In use
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential financial data in documents before they are published.

    Last change on 25th of June 2024, at 7:05 (CET) | Publication Standard 1.0
    Publication category
    High-Risk AI-system
    Impact assessment
    Field not filled in.
    Status
    In development
  • Among other things, the algorithm recognises and anonymises (personal) data and confidential (financial) data in documents before they are published, e.g. on the basis of the Open Government Act.

    Last change on 4th of April 2024, at 12:15 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use