Integrate Storm with Ambari in Ubuntu 14 - apache-kafka

I install Storm and Ambari UI in a Ubuntu machine.
But now I want to join the Storm with ambari UI. Is there any tutorial? anyone have tips?
Note: I have just installed on the virtual machine the Storm, Kafka and Ambari server (default).
I know that there is a VM NortonWorks with these pre-installed services, but the idea is to install on a virgin machine.
thanks :)

Are you trying to manage and monitor Storm through Ambari? If so then you must provision Storm through Ambari. You can do this by logging into the ambari UI then clicking the actions buttons and select 'Add Service'. Follow the service installation wizard to install Storm. During this process you will be able to configure Storm to your needs. Storm is available in HDP versions 2.1 and higher.

Related

Use kafka connect without docker

I just finished the udemy course on kafka connect,
the course is based on docker, but what should I do not want to use docker?
Kafka connect requires JVM. Although you can run it only with JRE, I recommend installing jdk (like openjdk). Download JAR from https://packages.confluent.io/archive/6.2/ (or version that you prefer). And run it as java process passing parameters file as a configuration.
You don't need Confluent Platform. Download Kafka from Apache website. It comes with all commands to run Kafka Connect. The only requirement is Java (version 11 is recommended, although 17 is the latest supported).
To install connectors, you can use confluent-hub without Confluent Platform

How to run kafka s3 sink connector in confluent 6.2.0

Have installed confluent 6.2.0 in my 3 kafka nodes and also installed confluentinc-kafka-connect-s3-10.0.1 in 3 nodes and modified the quickstart-s3.properties but not sure how to start that...
Same answer applies without Confluent Platform.
You'd run one of the included bin/connect-* scripts to start a Connect worker.
The quickstart file is only applicable for standalone mode.

How to get appropriate Kafka version when installing it from HDF 3.4 over HDP 3.1

I am building a Hortonworks cluster with both HDP and HDF to be installed. I first installed HDP and then installed/integrated with HDF on top of it.
Environment Details:
OS: Red Hat Enterprise Linux Server release 7.6 (Maipo)
Version: Ambari -2.7.3, HDP - 3.1, HDF -3.4.0
Basically HDP-3.1 has kafka 1.0.1 in the package and in HDF has kafka 2.1.0 packages are available and I need HDF version of Kafka to be available. Though I had installed Kafka from HDF, Ambari shows the kafka version of 1.0.1. After integration with HDF, it's not showing up the Kafka-2.1.0 in the Add service list.
I need to know, how can I get Kafka 2.1.0 installed in the cluster.
Also, Is it a possibility that version showed is 1.0.1, though Kafka 2.1.0 is installed.
Is it a possibility that version showed is 1.0.1, though Kafka 2.1.0 is installed
Doubtful. Ambari parses the packages that are installed on the machine to determine the versions
My suggestion would be to manually SSH to each machine and try to install Kafka from yum and see what versions are available.
If you appropriately setup the HDF YUM repos, then that version of Kafka should be available.
Alternatively, you could always install Kafka/Zookeeper externally and manage it outside of Ambari

Download Kafka or confluent platform in package distributed CDH 5.16

I have installed CDH 5.16 Express using packages in a RHEL server. I am trying to install Kafka now and i observed that it can be installed only if CDH is installed as parcels.
1) Is it possible to install Kafka or confluent platform" separately in the server and use it along withCDH` components.
2) Is there any other workaround to install Kafka using Cloudera Manager
In order use the CDK 4.0 (cloudera distribution of Kafka) with Cloudera 5.13, I was forced to install CDK 4.0 as a parcel.
I had a cloudera quickstart docker VM that I downloaded. It runs without Kerberos authentication. After starting the quickstart VM, I separately installed the quickstart Kafka from Apache kafka's website. This was required as the kafka packaged within cloudera was a older version. Since, this was non kerberos environment, the Kafka server upon startup started using the zookeeper that was running in quickstart VM. This way I achieved connection of Kafka with cloudera VM.
If you are new to CDH/CM then I suggest you first try and use the Kafka service that is bundled within Cloudera. Go to 'Add Service' within Cloudera drop-down and select kafka. Enabling this Kafka service will give you a set of brokers for kafka to run. Also, Kafka needs Zookeeper to run. Zookeeper comes by default in Cloudera. So, you would get a working cluster with kafka enabled in it. You can think of changing to the latest version of Kafka (using the approach mentioned above) once you are comfortable with inbuilt tools of CDH/CM.

How to Install KAFKA Inside Virtual Box Have Hortonworks

I installed Oracle Virtual Machine and inside that did Hortonworks
set up.
Now I am trying to install Kafka in it.
When I fetch file using wget it got installed.
How can I see in which location the file saved.
And how to call it from Virtual Box.
How can I see all dependencies has install which required for KAFKA like
Java, scala,zookeeper
Please help
Thanks
Not sure why you're using wget when Ambari should be installing components for you.
Hortonworks installs all libraries under /usr/hdp/current
There should be a Kafka folder there
However, it's recommended you use Ambari to configure those resources, and all Kafka CLI tools should be on your path already