Q1 : Explain the future growth of Apache Ambari?
A : We have seen the massive usage of data analysis which brings huge clusters in place, due to increasing demand for big data technologies like Hadoop. Hence, more visibility companies are leaning towards the technologies like Apache Ambari, for better management of these clusters with enhanced operational efficiency.
In addition, HortonWorks is working on Ambari to make it more scalable. Thus, gaining knowledge of Apache Ambari is an added advantage with Hadoop also.
Q2 : What are the three layers where the Hadoop components are actually supported by Ambari?
A : The three layers that are supported by Ambari are below:
1. Core Hadoop
2. Essential Hadoop
3. Hadoop Support
Q3 : What are the areas where Ambari helps the system administrators to do?
A : With the help of Ambari, system administrators will be able to do the following easily, they are:
1. Provision of Hadoop Cluster
2. Manage a Hadoop cluster
3. Monitor a Hadoop Cluster
Q4 : What is a local repository and when it is useful while using Ambari environment?
A : A local repository is nothing but a hosted space in the local environment. Usually, when the machines don’t have an active internet connection or have restricted or very limited network access a local repository should be set up. With this setup, the user will be able to obtain Ambari and HDP software packages.
Q5 : What is a repository in Apache Ambari?
A : A repository is a hosted space for Ambari software packages for downloading and installing purposes. Apache provides different versions of repositories which are OS specific. Moreover, based on internet accessibility, you can access either of the two formats of repositories:
- Tarball (.tar format, if you don’t have internet access)
- Repo file(.repo format for temporary internet access)
Q6 : How many types of Ambari Repositories are available?
A : The types of Ambari Repositories are listed below:
1. Ambari: This is for Ambari server, Ambari agent and other monitoring software packages
2. HDP: This is used to host Hadoop Stack packages
3. HDP-UTILS: All the utility packages for Ambari and HDP are available
4. EPEL: It stands for “Extra Packages for Enterprise Linux. It has a set of additional packages for the Enterprise Linux
Q7 : Is Ambari Python Client can be used to make good use of Ambari API’s?
A : Yes, Ambari Python client can be used to make good use of Ambari API’s.
Q8 : Name several advantages of Apache Ambari.
A : Here is a list of 8 best Apache Ambari advantages :
- Installation, configuration, and management is way
- Centralized security and application
- Complete visibility to cluster health
- Security
- Metrics visualization and dashboarding
- Customization
- Open-source
- Extensible
Q9 : What are the independent extensions that are contributed to the Ambari codebase?
A : The independent extensions that are contributed to the Ambari Codebase are as follows:
1. Ambari SCOM Management Pack
2. Apache Slider View
Q10 : In which language is the Ambari Shell is developed?
Ans. In Java Language, Ambari shell is developed. Moreover, it is based on Ambari REST client as well as the spring shell framework.
Q11 : What are the different types of Ambari repositories?
A : There are mainly four types of Ambari Repositories as listed below –
- Ambari: This repository is used for the Ambari server, the monitoring software packages, and Ambari agent.
- HDP-UTILS: This repository is used for Ambari and HDP utility packages
- HDP: The repository to host Hadoop Stack packages
- Extra Packages for Enterprise Linux( EPEL): The repository with an additional set of packages for the Enterprise Linux
Q12 : What does Ambari Shell can provide?
A : The Ambari shell can provide an interactive and handy command line tool which actually supports the following:
1. All the available functionality in Ambari Web-app
2. All the context-aware command availability
3. Tab completion
4. Any required parameter support if needed
Q13 : In which language is the Ambari Shell is developed?
A : The shell is developed in Java and it actually based on Ambari REST client and the spring shell framework.
Q14 : How to set up local repository manually?
A : This process is only used when there is no active internet connection is not available. So to set up a local repository, please follow the below steps:
1. First and foremost, set up a host with Apache httpd
2. Next is to download Tarball copy for every repository entire contents
3. Once it is downloaded, one has to extract the contents
Q15 : What is the role of “ambari-qa” user?
A : ‘ambari-qa’ user account performs a service check against cluster services that are created by Ambari on all nodes in the cluster.
Q16 : What is the required action you need to perform if you opt for scheduled maintenance on the cluster nodes?
A : Especially, for all the nodes in the cluster, Ambari offers Maintenance mode option. Hence before performing maintenance, we can enable the maintenance mode of Ambari to avoid alerts.
Q17 : What are the tools that are needed to build Ambari?
A : The following tools are needed to build Ambari:
1. If you are using Mac then you have to download Xcode from the apple store.
2. JDK 7
3. Apache Maven 3.3.9 or later
4. Python 2.6 or later
5. Node JS
6. G++
Q18 : What is the latest version of Ambari that is available in the market and what is the feature that they have added in it?
A : The latest version of Ambari that is available in the market is Ambari 2.5.2. Within, this version they have added a feature called: Cross stack upgrade support.
Q19 : What are the benefits of setting up a local repository?
A : First and foremost by setting up a local repository, you can access Ambari software packages without internet access. Along with that, you can achieve benefits like –
-
- Enhanced governance with better installation performance
- Routine post-installation cluster operations like service start and restart operations
Q20 : What are the tools that are used in Ambari Monitoring?
A : Ambari Monitoring tools actually use two different open source projects for its monitoring purposes,
they are as follows:
- Ganglia
- Nagios
Q21 : What is Apache Ambari?
A : The Apache Ambari is nothing but a project which is solely focused to make life simple while using Hadoop management system. This software helps or provides comfort zone in terms of the following aspect:
1. Provisioning
2. Managing
3. Monitoring Hadoop clusters
4. Provides intuitive interface
5. It is backed up RESTful API’s.
6. Provides an easy to use Hadoop management web UI
Q22 : What are the core benefits for Hadoop users by using Apache Ambari?
A : The Apache Ambari is a great gift for individuals who use Hadoop in their day to day work life. With the use of Ambari, Hadoop users will get the core benefits:
1. The installation process is simplified
2. Configuration and overall management is simplified
3. It has a centralized security setup process
4. It gives out full visibility in terms of Cluster health
5. It is extensively extendable and has an option to customize if needed.
Q23 : How is recovery achieved in Ambari?
A : Recovery happens in Ambari in the following ways:
- Based on actions
In Ambari after a restart master checks for pending actions and reschedules them since every action is persisted here. Also, the master rebuilds the state machines when there is a restart, as the cluster state is persisted in the database. While actions complete master actually crash before recording their completion, when there is a race condition. Well, the actions should be idempotent this is a special consideration taken. And, those actions which have not marked as complete or have failed in the DB, the master restarts them. We can see these persisted actions in Redo Logs.
- Based on the desired state
While there is a restart the master tries to make the cluster in the live state as per the desired state as the desired state of the cluster is persisted by the master.
Q24 : What is Ganglia is used for in Ambari?
A : It is one of the tools that is used in Ambari, it is mainly used for the following purpose:
1. Monitoring
2. Identifying trending patterns
3. Metrics collection in the clusters
4. It also supports detailed heatmaps
Q25 : What are the different life cycle commands in Ambari?
A : The Ambari has a defined life cycle commands and they are as follows:
1. Start
2. Stop
3. Status
4. Install
5. Configure
It is very flexible in terms of adding or removing or reconfiguring any of the services at any time.