clemlabprojects / ambari

Fork of Apache Ambari maintained by Clemlab Company
https://www.clemlab.com
Apache License 2.0
44 stars 19 forks source link

Error during cluster installation using source repo 1.2.2.0, python3, and Ubuntu 22.04 #97

Open simon4477 opened 1 week ago

simon4477 commented 1 week ago

I have installed ambari-server and agent following the official documentation. Hosts are registered and all hosts checks are passed. Repository URL's are also validated. I encountered errors during cluster installation. Here are the details: Ambari Repo:

https://archive.clemlab.com/centos7-python3/ambari-release/2.7.9.0.0-110/ambari.repo

Stack Repo:

ODP-1.2:           https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-release/1.2.2.0-46/ODP
ODP-UTILS-1.2.2.0: https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-utils/1.2.2.0/ODP-UTILS

Error during installing DataNode:

stderr: 
self.save_component_version_to_structured_out(self.command_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 223, in save_component_version_to_structured_out
    stack_select_package_name = stack_select.get_package_name()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 116, in get_package_name
    package = get_packages(PACKAGE_SCOPE_STACK_SELECT, service_name, component_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 230, in get_packages
    supported_packages = get_supported_packages()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 154, in get_supported_packages
    raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
Fail: Unable to query for supported packages using /usr/bin/odp-select
 stdout:
raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
Fail: Unable to query for supported packages using /usr/bin/odp-select
2024-11-10 08:09:58,751 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 08:09:58,754 - Stack Feature Version Info: Cluster Stack=1.2, Command Stack=None, Command Version=None -> 1.2
2024-11-10 08:09:58,764 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 08:09:58,774 - Command repositories: ODP-1.2-repo-1, ODP-UTILS-1.2.2.0-repo-1
2024-11-10 08:09:58,775 - Applicable repositories: ODP-1.2-repo-1, ODP-UTILS-1.2.2.0-repo-1
2024-11-10 08:10:06,376 - Looking for matching packages in the following repositories: clemlabs.s3.eu-west-3.amazonaws.com_ubuntu22_odp-release_1.2.2.0-46_ODP, clemlabs.s3.eu-west-3.amazonaws.com_ubuntu22_odp-utils_1.2.2.0_ODP-UTILS
2024-11-10 08:10:06,377 - Package['hadoop-1-2-2-0-46-client'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 08:10:06,392 - Installing package hadoop-1-2-2-0-46-client ('/usr/bin/apt-get -o Dpkg::Options::=--force-confdef --allow-unauthenticated --assume-yes install hadoop-1-2-2-0-46-client')

I changed the stack repository URLs according to the [official documentation]():

ODP-1.2:           https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP
ODP-UTILS-1.2.2.0: https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS

The ODP-UTILS URL given in the documentation https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0-138/ODP-UTILS does not get validated. when i change 1.2.2.0-138 to 1.2.2.0 then it gets validated. But i get the following Errors during installing DataNode:

stderr: 
2024-11-10 08:32:23,792 - Reporting component version failed
Traceback (most recent call last):
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 367, in execute
    self.save_component_version_to_structured_out(self.command_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 223, in save_component_version_to_structured_out
    stack_select_package_name = stack_select.get_package_name()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 116, in get_package_name
    package = get_packages(PACKAGE_SCOPE_STACK_SELECT, service_name, component_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 230, in get_packages
    supported_packages = get_supported_packages()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 154, in get_supported_packages
    raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
Fail: Unable to query for supported packages using /usr/bin/odp-select
2024-11-10 08:33:06,752 - Reporting component version failed
Traceback (most recent call last):
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 367, in execute
    self.save_component_version_to_structured_out(self.command_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 223, in save_component_version_to_structured_out
    stack_select_package_name = stack_select.get_package_name()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 116, in get_package_name
    package = get_packages(PACKAGE_SCOPE_STACK_SELECT, service_name, component_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 230, in get_packages
    supported_packages = get_supported_packages()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 154, in get_supported_packages
    raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
Fail: Unable to query for supported packages using /usr/bin/odp-select
Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/stacks/ODP/1.0/services/HDFS/package/scripts/datanode.py", line 126, in <module>
    DataNode().execute()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 352, in execute
    method(env)
  File "/var/lib/ambari-agent/cache/stacks/ODP/1.0/services/HDFS/package/scripts/datanode.py", line 45, in install
    self.install_packages(env)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 853, in install_packages
    retry_count=agent_stack_retry_count)
  File "/usr/lib/ambari-agent/lib/resource_management/core/base.py", line 166, in __init__
    self.env.run()
  File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 160, in run
    self.run_action(resource, action)
  File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 124, in run_action
    provider_action()
  File "/usr/lib/ambari-agent/lib/resource_management/core/providers/packaging.py", line 30, in action_install
    self._pkg_manager.install_package(package_name, self.__create_context())
  File "/usr/lib/ambari-agent/lib/ambari_commons/repo_manager/apt_manager.py", line 35, in wrapper
    return function_to_decorate(self, name, *args[2:], **kwargs)
  File "/usr/lib/ambari-agent/lib/ambari_commons/repo_manager/apt_manager.py", line 282, in install_package
    shell.repository_manager_executor(cmd, self.properties, context, env=self.properties.install_cmd_env)
  File "/usr/lib/ambari-agent/lib/ambari_commons/shell.py", line 753, in repository_manager_executor
    raise RuntimeError(message)
RuntimeError: Failed to execute command '/usr/bin/apt-get -o Dpkg::Options::=--force-confdef --allow-unauthenticated --assume-yes install hadoop-1-2-2-0-138-client', exited with code '100', message: 'E: dpkg was interrupted, you must manually run 'dpkg --configure -a' to correct the problem. 
'
 stdout:
2024-11-10 08:32:20,708 - Stack Feature Version Info: Cluster Stack=1.2, Command Stack=None, Command Version=None -> 1.2
2024-11-10 08:32:20,710 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 08:32:20,711 - Group['livy'] {}
2024-11-10 08:32:20,711 - Group['spark'] {}
2024-11-10 08:32:20,712 - Group['hdfs'] {}
2024-11-10 08:32:20,712 - Group['hadoop'] {}
2024-11-10 08:32:20,712 - Group['users'] {}
2024-11-10 08:32:20,712 - User['hive'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 08:32:20,713 - User['yarn-ats'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 08:32:20,715 - User['livy'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['livy', 'hadoop'], 'uid': None}
2024-11-10 08:32:20,716 - User['zookeeper'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 08:32:20,717 - User['spark'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['spark', 'hadoop'], 'uid': None}
2024-11-10 08:32:20,718 - User['ambari-qa'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop', 'users'], 'uid': None}
2024-11-10 08:32:20,719 - User['kafka'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 08:32:20,720 - User['tez'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop', 'users'], 'uid': None}
2024-11-10 08:32:20,720 - User['hdfs'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hdfs', 'hadoop'], 'uid': None}
2024-11-10 08:32:20,722 - User['yarn'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 08:32:20,722 - User['mapred'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 08:32:20,723 - File['/var/lib/ambari-agent/tmp/changeUid.sh'] {'content': StaticFile('changeToSecureUid.sh'), 'mode': 0555}
2024-11-10 08:32:20,724 - Execute['/var/lib/ambari-agent/tmp/changeUid.sh ambari-qa /tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa 0'] {'not_if': '(test $(id -u ambari-qa) -gt 1000) || (false)'}
2024-11-10 08:32:20,729 - Skipping Execute['/var/lib/ambari-agent/tmp/changeUid.sh ambari-qa /tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa 0'] due to not_if
2024-11-10 08:32:20,729 - Group['hdfs'] {}
2024-11-10 08:32:20,729 - User['hdfs'] {'fetch_nonlocal_groups': True, 'groups': ['hdfs', 'hadoop', u'hdfs']}
2024-11-10 08:32:20,730 - FS Type: HDFS
2024-11-10 08:32:20,730 - Directory['/etc/hadoop'] {'mode': 0755}
2024-11-10 08:32:20,731 - Directory['/var/lib/ambari-agent/tmp/hadoop_java_io_tmpdir'] {'owner': 'hdfs', 'group': 'hadoop', 'mode': 01777}
2024-11-10 08:32:20,740 - Repository['ODP-1.2-repo-51'] {'base_url': 'https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP', 'action': ['prepare'], 'components': [u'ODP', 'main'], 'repo_template': '{{package_type}} {{base_url}} {{components}}', 'repo_file_name': 'ambari-odp-51', 'mirror_list': None}
2024-11-10 08:32:20,743 - Repository['ODP-UTILS-1.2.2.0-repo-51'] {'base_url': 'https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS', 'action': ['prepare'], 'components': [u'ODP-UTILS', 'main'], 'repo_template': '{{package_type}} {{base_url}} {{components}}', 'repo_file_name': 'ambari-odp-51', 'mirror_list': None}
2024-11-10 08:32:20,744 - Repository[None] {'action': ['create']}
2024-11-10 08:32:20,745 - File['/tmp/tmpXWEIQA'] {'content': 'deb https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP ODP main\ndeb https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS ODP-UTILS main'}
2024-11-10 08:32:20,746 - Writing File['/tmp/tmpXWEIQA'] because contents don't match
2024-11-10 08:32:20,746 - Rewriting /etc/apt/sources.list.d/ambari-odp-51.list since it has changed.
2024-11-10 08:32:20,746 - File['/etc/apt/sources.list.d/ambari-odp-51.list'] {'content': StaticFile('/tmp/tmpXWEIQA')}
2024-11-10 08:32:20,746 - Writing File['/etc/apt/sources.list.d/ambari-odp-51.list'] because it doesn't exist
2024-11-10 08:32:20,747 - call[['apt-get', 'update', '-qq', '-o', u'Dir::Etc::sourcelist=sources.list.d/ambari-odp-51.list', '-o', 'Dir::Etc::sourceparts=-', '-o', 'APT::Get::List-Cleanup=0']] {'sudo': True, 'quiet': False}
2024-11-10 08:32:23,719 - call returned (0, 'W: https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP/dists/ODP/InRelease: Key is stored in legacy trusted.gpg keyring (/etc/apt/trusted.gpg), see the DEPRECATION section in apt-key(8) for details.\nW: https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS/dists/ODP-UTILS/InRelease: Key is stored in legacy trusted.gpg keyring (/etc/apt/trusted.gpg), see the DEPRECATION section in apt-key(8) for details.')
2024-11-10 08:32:23,720 - Package['unzip'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 08:32:23,760 - Skipping installation of existing package unzip
2024-11-10 08:32:23,760 - Package['curl'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 08:32:23,777 - Skipping installation of existing package curl
2024-11-10 08:32:23,792 - Reporting component version failed
Traceback (most recent call last):
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 367, in execute
    self.save_component_version_to_structured_out(self.command_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 223, in save_component_version_to_structured_out
    stack_select_package_name = stack_select.get_package_name()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 116, in get_package_name
    package = get_packages(PACKAGE_SCOPE_STACK_SELECT, service_name, component_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 230, in get_packages
    supported_packages = get_supported_packages()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 154, in get_supported_packages
    raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
Fail: Unable to query for supported packages using /usr/bin/odp-select
2024-11-10 08:32:23,908 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 08:32:23,909 - Stack Feature Version Info: Cluster Stack=1.2, Command Stack=None, Command Version=None -> 1.2
2024-11-10 08:32:23,919 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 08:32:23,930 - Command repositories: ODP-1.2-repo-51, ODP-UTILS-1.2.2.0-repo-51
2024-11-10 08:32:23,930 - Applicable repositories: ODP-1.2-repo-51, ODP-UTILS-1.2.2.0-repo-51
2024-11-10 08:32:32,390 - Looking for matching packages in the following repositories: archive.clemlab.com_ubuntu22-python3_odp-release_1.2.2.0-138_ODP, archive.clemlab.com_ubuntu22-python3_odp-utils_1.2.2.0_ODP-UTILS
2024-11-10 08:32:32,392 - Package['hadoop-1-2-2-0-138-client'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 08:32:32,428 - Installing package hadoop-1-2-2-0-138-client ('/usr/bin/apt-get -o Dpkg::Options::=--force-confdef --allow-unauthenticated --assume-yes install hadoop-1-2-2-0-138-client')
2024-11-10 08:33:06,752 - Reporting component version failed
Traceback (most recent call last):
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 367, in execute
    self.save_component_version_to_structured_out(self.command_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 223, in save_component_version_to_structured_out
    stack_select_package_name = stack_select.get_package_name()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 116, in get_package_name
    package = get_packages(PACKAGE_SCOPE_STACK_SELECT, service_name, component_name)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 230, in get_packages
    supported_packages = get_supported_packages()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/functions/stack_select.py", line 154, in get_supported_packages
    raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
Fail: Unable to query for supported packages using /usr/bin/odp-select

Command failed after 1 tries

I manually uninstall all the packages, rebooted all machines, restarted all services, retired, but got the same errors.

apt clean && apt autoremove
dpkg --configure -a

I changed the stack repo back to defaults:

ODP-1.2:           https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-release/1.2.2.0-46/ODP
ODP-UTILS-1.2.2.0: https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-utils/1.2.2.0/ODP-UTILS

Errors during installing DataNode:

stderr: 
Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/stacks/ODP/1.0/services/HDFS/package/scripts/datanode.py", line 126, in <module>
    DataNode().execute()
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 352, in execute
    method(env)
  File "/var/lib/ambari-agent/cache/stacks/ODP/1.0/services/HDFS/package/scripts/datanode.py", line 45, in install
    self.install_packages(env)
  File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 853, in install_packages
    retry_count=agent_stack_retry_count)
  File "/usr/lib/ambari-agent/lib/resource_management/core/base.py", line 166, in __init__
    self.env.run()
  File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 160, in run
    self.run_action(resource, action)
  File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 124, in run_action
    provider_action()
  File "/usr/lib/ambari-agent/lib/resource_management/core/providers/packaging.py", line 30, in action_install
    self._pkg_manager.install_package(package_name, self.__create_context())
  File "/usr/lib/ambari-agent/lib/ambari_commons/repo_manager/apt_manager.py", line 35, in wrapper
    return function_to_decorate(self, name, *args[2:], **kwargs)
  File "/usr/lib/ambari-agent/lib/ambari_commons/repo_manager/apt_manager.py", line 282, in install_package
    shell.repository_manager_executor(cmd, self.properties, context, env=self.properties.install_cmd_env)
  File "/usr/lib/ambari-agent/lib/ambari_commons/shell.py", line 753, in repository_manager_executor
    raise RuntimeError(message)
RuntimeError: Failed to execute command '/usr/bin/apt-get -o Dpkg::Options::=--force-confdef --allow-unauthenticated --assume-yes install hadoop-1-2-2-0-138-client', exited with code '100', message: 'No apport report written because the error message indicates its a followup error from a previous failure.

No apport report written because MaxReports is reached already

No apport report written because MaxReports is reached already

No apport report written because MaxReports is reached already

No apport report written because MaxReports is reached already

No apport report written because MaxReports is reached already

No apport report written because MaxReports is reached already

E: Sub-process /usr/bin/dpkg returned an error code (1)
'
 stdout:
2024-11-10 09:36:19,785 - Stack Feature Version Info: Cluster Stack=1.2, Command Stack=None, Command Version=None -> 1.2
2024-11-10 09:36:19,788 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 09:36:19,788 - Group['livy'] {}
2024-11-10 09:36:19,789 - Group['spark'] {}
2024-11-10 09:36:19,790 - Group['hdfs'] {}
2024-11-10 09:36:19,790 - Group['hadoop'] {}
2024-11-10 09:36:19,790 - Group['users'] {}
2024-11-10 09:36:19,790 - User['hive'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 09:36:19,792 - User['yarn-ats'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 09:36:19,793 - User['livy'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['livy', 'hadoop'], 'uid': None}
2024-11-10 09:36:19,794 - User['zookeeper'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 09:36:19,795 - User['spark'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['spark', 'hadoop'], 'uid': None}
2024-11-10 09:36:19,796 - User['ambari-qa'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop', 'users'], 'uid': None}
2024-11-10 09:36:19,797 - User['kafka'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 09:36:19,798 - User['tez'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop', 'users'], 'uid': None}
2024-11-10 09:36:19,799 - User['hdfs'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hdfs', 'hadoop'], 'uid': None}
2024-11-10 09:36:19,800 - User['yarn'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 09:36:19,801 - User['mapred'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': ['hadoop'], 'uid': None}
2024-11-10 09:36:19,802 - File['/var/lib/ambari-agent/tmp/changeUid.sh'] {'content': StaticFile('changeToSecureUid.sh'), 'mode': 0555}
2024-11-10 09:36:19,804 - Execute['/var/lib/ambari-agent/tmp/changeUid.sh ambari-qa /tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa 0'] {'not_if': '(test $(id -u ambari-qa) -gt 1000) || (false)'}
2024-11-10 09:36:19,808 - Skipping Execute['/var/lib/ambari-agent/tmp/changeUid.sh ambari-qa /tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa 0'] due to not_if
2024-11-10 09:36:19,808 - Group['hdfs'] {}
2024-11-10 09:36:19,809 - User['hdfs'] {'fetch_nonlocal_groups': True, 'groups': ['hdfs', 'hadoop', u'hdfs']}
2024-11-10 09:36:19,810 - FS Type: HDFS
2024-11-10 09:36:19,810 - Directory['/etc/hadoop'] {'mode': 0755}
2024-11-10 09:36:19,819 - File['/usr/odp/current/hadoop-client/conf/hadoop-env.sh'] {'content': InlineTemplate(...), 'owner': 'hdfs', 'group': 'hadoop'}
2024-11-10 09:36:19,820 - Directory['/var/lib/ambari-agent/tmp/hadoop_java_io_tmpdir'] {'owner': 'hdfs', 'group': 'hadoop', 'mode': 01777}
2024-11-10 09:36:19,831 - Repository['ODP-1.2-repo-151'] {'base_url': 'https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP', 'action': ['prepare'], 'components': [u'ODP', 'main'], 'repo_template': '{{package_type}} {{base_url}} {{components}}', 'repo_file_name': 'ambari-odp-151', 'mirror_list': None}
2024-11-10 09:36:19,835 - Repository['ODP-UTILS-1.2.2.0-repo-151'] {'base_url': 'https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS', 'action': ['prepare'], 'components': [u'ODP-UTILS', 'main'], 'repo_template': '{{package_type}} {{base_url}} {{components}}', 'repo_file_name': 'ambari-odp-151', 'mirror_list': None}
2024-11-10 09:36:19,835 - Repository[None] {'action': ['create']}
2024-11-10 09:36:19,836 - File['/tmp/tmpyeYtPf'] {'content': 'deb https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP ODP main\ndeb https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS ODP-UTILS main'}
2024-11-10 09:36:19,837 - Writing File['/tmp/tmpyeYtPf'] because contents don't match
2024-11-10 09:36:19,837 - Rewriting /etc/apt/sources.list.d/ambari-odp-151.list since it has changed.
2024-11-10 09:36:19,837 - File['/etc/apt/sources.list.d/ambari-odp-151.list'] {'content': StaticFile('/tmp/tmpyeYtPf')}
2024-11-10 09:36:19,837 - Writing File['/etc/apt/sources.list.d/ambari-odp-151.list'] because it doesn't exist
2024-11-10 09:36:19,838 - call[['apt-get', 'update', '-qq', '-o', u'Dir::Etc::sourcelist=sources.list.d/ambari-odp-151.list', '-o', 'Dir::Etc::sourceparts=-', '-o', 'APT::Get::List-Cleanup=0']] {'sudo': True, 'quiet': False}
2024-11-10 09:36:21,579 - call returned (0, 'W: https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP/dists/ODP/InRelease: Key is stored in legacy trusted.gpg keyring (/etc/apt/trusted.gpg), see the DEPRECATION section in apt-key(8) for details.\nW: https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS/dists/ODP-UTILS/InRelease: Key is stored in legacy trusted.gpg keyring (/etc/apt/trusted.gpg), see the DEPRECATION section in apt-key(8) for details.')
2024-11-10 09:36:21,579 - Package['unzip'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 09:36:21,619 - Skipping installation of existing package unzip
2024-11-10 09:36:21,619 - Package['curl'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 09:36:21,641 - Skipping installation of existing package curl
2024-11-10 09:36:21,779 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 09:36:21,780 - Stack Feature Version Info: Cluster Stack=1.2, Command Stack=None, Command Version=None -> 1.2
2024-11-10 09:36:21,788 - Using hadoop conf dir: /usr/odp/current/hadoop-client/conf
2024-11-10 09:36:21,799 - Command repositories: ODP-1.2-repo-151, ODP-UTILS-1.2.2.0-repo-151
2024-11-10 09:36:21,799 - Applicable repositories: ODP-1.2-repo-151, ODP-UTILS-1.2.2.0-repo-151
2024-11-10 09:36:32,329 - Looking for matching packages in the following repositories: archive.clemlab.com_ubuntu22-python3_odp-release_1.2.2.0-138_ODP, archive.clemlab.com_ubuntu22-python3_odp-utils_1.2.2.0_ODP-UTILS
2024-11-10 09:36:32,330 - Package['hadoop-1-2-2-0-138-client'] {'retry_on_repo_unavailability': False, 'retry_count': 5}
2024-11-10 09:36:32,342 - Installing package hadoop-1-2-2-0-138-client ('/usr/bin/apt-get -o Dpkg::Options::=--force-confdef --allow-unauthenticated --assume-yes install hadoop-1-2-2-0-138-client')

Command failed after 1 tries
lucasbak commented 1 week ago

Hi @simon4477,

Thank you for your issue Indeed there is a wrong doc on the ODP-UTILS link it is 1.2.2.0 instead of 1.2.2.0-138

What is the exact error ?

simon4477 commented 1 week ago

@lucasbak at the stack repository settings page, Ambari doesn't show any error. It just displays a message that the ODP-UTILS repository URL is not validated. I changed 1.2.2.0-138 to 1.2.2.0 and then it gets validated successfully.

The errors come when deploying the cluster, its installation errors. The exact installation errors I already listed.

The default stack repo URL's in Ambari are:

ODP-1.2:           https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-release/1.2.2.0-46/ODP
ODP-UTILS-1.2.2.0: https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-utils/1.2.2.0/ODP-UTILS

But the documentation lists these URLs:

ODP-1.2:           https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/ODP
ODP-UTILS-1.2.2.0: https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0-138/ODP-UTILS

OfCourse the -138 is an error in ODP-Utils URL.

Question: Which stack repo URLs should I use? The Ambari default ones or the ones in the documentation?

Using either stack repository (Ambari default and docs) I get errors during cluster deployment (at the Installing DataNode phase which is the first one always), which I listed above.

lucasbak commented 1 week ago

Please find below the correct VDF file for ODP 1.2.2.0-138:

<repository-version xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="version_definition.xsd">
  <release>
    <type>STANDARD</type>
    <stack-id>ODP-1.2</stack-id>
    <version>1.2.2.0</version>
    <build>138</build>
    <compatible-with>1\.\2\.\d+\.\d+</compatible-with>
    <release-notes>http://example.com</release-notes>
    <display>ODP-1.2.2.0-138</display>
  </release>
  <manifest>
    <service id="ACCUMULO-170" name="ACCUMULO" version="1.7.0"/>
    <service id="ATLAS-230" name="ATLAS" version="2.3.0"/>
    <service id="FLINK-1170" name="FLINK" version="1.17.0"/>
    <service id="HBASE-256" name="HBASE" version="2.5.6"/>
    <service id="HDFS-336" name="HDFS" version="3.3.6"/>
    <service id="HIVE-313" name="HIVE" version="3.1.3"/>
    <service id="KNOX-200" name="KNOX" version="2.0.0"/>
    <service id="KAFKA-258" name="KAFKA" version="2.5.8"/>
    <service id="MAPREDUCE2-336" name="MAPREDUCE2" version="3.3.6"/>
    <service id="NIFI-1240" name="NIFI" version="1.24.0"/>
    <service id="NIFI_REGISTRY-080" name="NIFI_REGISTRY" version="0.8.0"/>
    <service id="OOZIE-521" name="OOZIE" version="5.2.1"/>
    <service id="OZONE-130" name="OZONE" version="1.3.0"/>
    <service id="PIG-0160" name="PIG" version="0.16.0"/>
    <service id="RANGER-240" name="RANGER" version="2.4.0"/>
    <service id="RANGER_KMS-240" name="RANGER_KMS" version="2.4.0"/>
    <service id="SQOOP-147" name="SQOOP" version="1.4.7"/>
    <service id="SPARK2-248" name="SPARK2" version="2.4.8"/>
    <service id="SPARK3-342" name="SPARK3" version="3.4.2"/>
    <service id="STORM-121" name="STORM" version="1.2.1"/>
    <service id="TEZ-0102" name="TEZ" version="0.10.2"/>
    <service id="YARN-336" name="YARN" version="3.3.6"/>
    <service id="ZEPPELIN-0101" name="NIFI_REGISTRY" version="0.10.1"/>
    <service id="ZOOKEEPER-383" name="ZOOKEEPER" version="3.8.3"/>
  </manifest>
  <available-services/>
  <repository-info>
    <os family="ubuntu22">
      <package-version>1_2_2_0_*</package-version>
      <repo>
        <baseurl>https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-release/1.2.2.0-138/ODP</baseurl>
        <repoid>ODP-1.2</repoid>
        <reponame>ODP</reponame>
        <unique>true</unique>
      </repo>
      <repo>
        <baseurl>https://clemlabs.s3.eu-west-3.amazonaws.com/ubuntu22/odp-utils/1.2.2.0/ODP-UTILS</baseurl>
        <repoid>ODP-UTILS-1.2.2.0</repoid>
        <reponame>ODP-UTILS</reponame>
        <unique>false</unique>
      </repo>
    </os>
</repository-info>
</repository-version>

We are currently working on the next version and will fix the link on the website

Regards

simon4477 commented 1 week ago

Thank you @lucasbak. How do I use it with the public repository? Can I specify the VDF in Ambari?

Also, is the documentation opensource? I want to contribute to the documentation.

simon4477 commented 1 day ago

@lucasbak is the official documentation for Ubuntu 22.04 up to date? Has anyone ever succeeded deploying on Ubuntu 22.04? I have tried many times but never succeeded in deploying the cluster. There are countless errors in services installation during deployment.

lucasbak commented 1 day ago

Hi @simon4477,

Concerning the documentation, we are checking internally if and how to make it publicly available for PR.

ODP 1.2.2.0-138 is installed in production cluster on Ubuntu 22.04 with many customers so you should be able to deploy it. The difference with your install is that it is using python2. FYI we install it using Ansible https://github.com/clemlabprojects/ansible-odp-cluster-installation.git

What are the problem encountered ?

We have identified multiple problems on the python 3 version of ODP 1.2.2.0 but are fixed in ODP 1.2.4.0 which is about to be released soon (ODP > 1.2.4.0. and ambari > 2.7.11.0 aill be python3 only)

simon4477 commented 1 day ago

Hi @lucasbak

I am deploying the cluster using Ambari web UI. I am using python3 1.2.2.0-138.

Ambari:    https://archive.clemlab.com/ubuntu22-python3/ambari-release/2.7.9.0.0-110/ambari.list
ODP:       https://archive.clemlab.com/ubuntu22-python3/odp-release/1.2.2.0-138/odp.list
ODP-UTILS: https://archive.clemlab.com/ubuntu22-python3/odp-utils/1.2.2.0/ODP-UTILS

As you can see, they are all for python3. I will post the errors I encountered shortly.

Is there a guide/documentation of cluster deployment using Ansible https://github.com/clemlabprojects/ansible-odp-cluster-installation.git. I can't find any documentation about it.

P.S. when is the ODP 1.2.4.0 estimated to be released?