Notes from June 6th, 2025
Abandon of building TDP2 with a stack based on Hadoop 3.4. TDP2 will use Hadoop 3.3.6 instead. Discussions around the TDP CLI commands.
Pull Requests
Weekly review of open PRs (in chronological order):
- tdp-website#151: status of “feat(reports): contributors report of the May 23rd 2025”. PR merged.
- tdp-lib#647: status of “Add the tdp vars import command”. Ongoing investigations.
- tdp-collection#950: status of “feat(zookeeper): allow specific 4lw commands via whitelist”. PR merged.
Open Topics
TDP 2
Using Hadoop 3.4.2 for TDP2 has been abandoned, as it was too complex to adapt in the TDP context and too costly in terms of time, without any results. TDP2 will use Hadoop 3.3.6 instead, along with Hive 4.0.0 including backported commits.
tdp-lib
A discussion about the TDP CLI commands led to the following changes:
- Rename the
tdp vars importcommand totdp vars update - Integrate the behavior of the
--eraseoption by default, and remove it from the options list default_varsshould be taken into account by default- Remove the
tdp vars editcommand - Refactor the
tdp initcommand:- Change the
--overridesoption with--conf - Allow the
--confoption to be specified multiple times (multiple = true) - The
confmust be in the current directory; display a warning if it does not exist or does not contain any.ymlfile. - If the
--confoption points to a Git repository: check the SHA and display a warning if the repository is in a dirty state - If
tdp_vars_defaultis from Galaxy: use the version
- Change the
- For the
tdp vars initcommand : create 1 commit pertdp_vars_defaultand--conf - For the
tdp vars updatecommand: create only 1 commit for all changes - List all the existing variables in the
.envfile - Hide the
tdp_varsfolder by calling it.__tdp_vars - update the
tdp-varsplugin to remove the default variable merging behavior