Please note: The algorithm descriptions in English have been automatically translated. Errors may have been introduced in this process. For the original descriptions, go to the Dutch version of the Algorithm Register.

Text analysis

Based on language technology, personal and company names are read and filtered out of text files such as emails and individual documents.

Last change on 14th of October 2024, at 10:47 (CET) | Publication Standard 1.0
Publication category
Other algorithms
Impact assessment
Field not filled in.
Status
In use

General information

Theme

Organisation and business operations

Begin date

januari

Contact information

info@waterschaplimburg.nl

Link to publication website

open.waterschaplimburg.nl

Responsible use

Goal and impact

Support in the review process where legal protection applies to information that is disclosed. Protection from the AVG (persons) and Woo laws (especially company confidential), where grounds for exception are named.

Considerations

Manual review is intensive and error-prone. A suggestion list from the entity extraction algorithm captures all conceivable instances of individuals in the text.

Human intervention

Within the software, a list is built and offered to the user to select in the automatic varnishing process. The choice to adopt an advised term as a person name and not to disclose it is up to the user.

Risk management

There is no risk of automated decision-making and the algorithm has no impact on fundamental rights because the algorithm does not make decisions with legal consequences. It only makes a proposal for anonymising personal data. The employee of the administrative body always makes the final check whether a document has been correctly anonymised.

Legal basis

Legislation around public access to government data (Woo)

Links to legal bases

Wet open overheid: https://wetten.overheid.nl/BWBR0045754/2023-04-01#Hoofdstuk5

Operations

Data

This refers to documents and messaging information within central government. Including email, files, Whatsapp messages and other media where administrative decision-making can be found.

Links to data sources

Algemene Office applicaties: Dit betreft standaard Office formaten inclusief email en social media formaten.

Technical design

Texts are recognised on the basis of Named Entity Recognition (NER) and a process within Insights extracts the names for further processing towards the management interface and the automatic lacquer rules.

External provider

ZyLAB eDiscovery & Compliance Services B.V.

Similar algorithm descriptions

  • Based on read-in data and answers provided by the applicant, the algorithm determines whether the applicant is eligible for any of the benefits to be applied for.

    Last change on 24th of June 2024, at 7:02 (CET) | Publication Standard 1.0
    Publication category
    Impactful algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • To properly categorise customer signals and extract improvement actions from them, a text mining algorithm is used so that open text fields are converted to categories.

    Last change on 22nd of October 2024, at 9:43 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • The algorithm recognises (personal) data and otherwise confidential information in a document and makes a proposal to anonymise it. A staff member evaluates the proposal and makes the final adjustment, making the document suitable for publication.

    Last change on 16th of August 2024, at 8:50 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    DPIA
    Status
    In use
  • The algorithm recognises (personal) data and otherwise confidential information in a document and makes a proposal to anonymise it. A staff member evaluates the proposal and makes the final adjustment, making the document suitable for publication.

    Last change on 7th of October 2024, at 15:33 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use
  • The algorithm recognises (personal) data and otherwise confidential information in a document and makes a proposal to anonymise it. A staff member evaluates the proposal and makes the final adjustment, making the document suitable for publication.

    Last change on 25th of January 2024, at 12:17 (CET) | Publication Standard 1.0
    Publication category
    Other algorithms
    Impact assessment
    Field not filled in.
    Status
    In use