Notes from June 27th, 2025
Attempt to implement Python 3.10 and Python 3.12 on the node control. TDP lib works, but the visualisation part of TDP lib does not. Conflict with versions when upgrading.
Pull Requests
Weekly review of open PRs (in chronological order):
- tdp-website#155: status of “feat(pages): created tables for stack 2.0” : Draft
- tdp-website#154: status of “feat(reports): contributors report of the June 20th 2025”. PR merged
- tdp-lib#647: status of “Add the tdp vars update command”. PR merged
- tdp-collection#952: status of “Feat/variable ergonomy”. PR merged
- tdp-collection#954: status of “feat(spark): add iceberg-mr jar to conduct operations on tables creat…“. PR merged.
Open Topics
HBase
New compilation of Apache HBase in version 2.6.1. This version upgrade meant that we also had to compile Phoenix Query Server and Phoenix with HBase 2.6.1.
The HBase tests don’t work with Python 3.12, whereas they work with Python 3.9.
Iceberg
When an Iceberg table is created in Hive and you want to delete it in Spark, you get error 953: Hive Iceberg storage handler is missing. You can still insert and read data in the table. We have found that adding iceberg.mr jar to Spark solves this error. The basic error probably comes from the CASCADE keyword. Investigation ongoing.
Ansible
Ansible 2.18 does not work to deploy TDP. Python 3 dnf is not available in Python 3.12. In addition, Rocky Linux 8 uses Python 3.6, so you will need to use a version of Ansible compatible with Python 3.6, as is the case for version 2.16 of Ansible.