User Deletion Cleanup Flink Job
The UserDeletionCleanupFunction
job handles the cleanup and deletion of user-related data from various storage systems when users are deleted. It ensures that all related information is removed or masked appropriately in multiple data sources, including Cassandra and Keycloak. Here's what the job does in simpler terms:
Identify User Deletion Events:
The job listens for events indicating that a user is to be deleted.
It retrieves user information from a user organization service to validate the deletion event.
Keycloak User Removal:
The job connects to Keycloak to remove the user credentials.
If removal fails, it masks sensitive information (e.g., email, phone) to ensure user data is not exposed.
Database Operations:
The job connects to Cassandra and updates the user lookup table to remove entries related to the user (like email, phone, external ID, and username).
It updates the main user table in Cassandra to reflect the deletion status by clearing or masking personal information.
It also removes user entries from the external identity table in Cassandra.
The job updates the organization table to mark the user as deleted and record the date of deletion.
Audit Events and Logging:
The job generates audit events to track the cleanup process for telemetry and auditing purposes.
It logs detailed information during the deletion process for troubleshooting and error handling.
In summary, this job ensures that when a user is deleted, all related data is appropriately removed or masked across multiple storage systems, maintaining data privacy and consistency. It plays a crucial role in the cleanup process, ensuring that user data is properly handled upon deletion.
Configuration Variables:
kafka.input.topic
{{env}}.delete.user
Kafka topic from which messages/events are read to be processed
kafka.groupId
{{env}}-delete-user-group
Kafka input topic group Id
user.keyspace
sunbird
Cassandra keyspace name
user.lookup.table
user_lookup
Cassandra table used to store user lookup data
user.table
user
Cassandra table used to store user details
user.externalIdentity.table
usr_external_identity
Cassandra table used to store user extrenal identity details
user.org.table
user_organisation
Cassandra table used to store organisation details
service.lms.basePath
lms base url
service.userorg.basePath
User-Org service URL
sunbird_keycloak_user_federation_provider_id
sunbird_keycloak_user_federation_provider_id
fedaration provider id for key cloak.
user_read_api
API route for fetching user profile details
batch_search_api
API route for fetching batch details
user.ownership.transfer.parallelism
1
Degree of parallelism for the user ownership
Sample event:
Last updated