Infrastructure at your Service

All posts by Mehdi Bada

Mehdi Bada
Consultant


Mehdi Bada

DockerCon18 Barcelona – Day 3

By | DevOps, Docker | No Comments

Wednesday was the last day of the DockerCon18 Europe. Like the previous day, we started the journey by a keynote of 2 hours, more oriented “Docker Community”, the core message of the keynote, the community is one of the pillars of open source technologies and Docker Inc want to push more and more the community aspect. The community is growing very fast and competitive. They take the opportunity to award the community leader of the year and a…

Read More
Mehdi Bada

DockerCon18 Barcelona – Day 2

By | DevOps, Docker | No Comments

Tuesday was the second day in Barcelona for the DockerCon18. We attend the first general session in the morning. It was a mix of presentations, live demos and the participation of Docker big customer in EMEA such as Société Général in France, who present us the impact of Docker in their daily business. The main message of the first part of the keynote was: “How Docker can help you to make the digital transformation of your business”….

Read More
Mehdi Bada

DockerCon2018 Barcelona – Day 1

By | DevOps, Docker | One Comment

As a football fan, traveling to Barcelona without watching a football game is inconceivable, so I started my travel by attending the game against Villareal in Camp Nou    DockerCon Day 1 Today, with David Barbarin, was our first day at DockerCon2018 in Barcelona. The conference is located in a huge conference center and split between types of sessions including workshops, keynotes, hands-on labs, and hallway track. The first day was only focused on workshops, hands-on labs and hallway track…

Read More
Mehdi Bada

MongoDB OPS Manager

By | Big Data, Database Administration & Monitoring, Database management, NoSQL | No Comments

MongoDB OPS Manager (MMS) is a tool for administering and managing MongoDB deployments, particularly large clusters. MongoDB Inc. qualified it as “the best way to manage your MongoDB data center“. OPS Manager also allows you to deploy a complete MongoDB cluster in multiple nodes and several topologies.  As you know, at dbi services, the MongoDB installation is based on our best practices, especially the MFA (MongoDB Flexible Architecture), more information here. Is OPS Manager compatible with…

Read More
Mehdi Bada

PII search using HCI

By | Database Administration & Monitoring | No Comments

In a previous blog, we described how to install Hitachi Content Intelligence the solution of Hitachi Ventara for data indexing and search. In this blog post, we will see how we can use Hitachi Content Intelligence to perform the basic search on personal information (PII). Data Connections HCI allows you to connect to multiple data source using default data connectors. The first step is to create a data connection. By default, multiples data connectors are…

Read More
Mehdi Bada

Hitachi Content Intelligence deployment

By | Big Data | No Comments

Hitachi Content Intelligence (HCI) is a search and data processing solution. It allows the extraction, classification, enrichment, and categorization of data, regardless of where the data lives or what format it’s in. Content Intelligence provides tools at large scale across multiple repositories. These tools are useful for identifying, blending, normalizing, querying, and indexing data for search, discovery, and reporting purposes. Architecture HCI has components called data connections that it uses to access the places where…

Read More
Mehdi Bada

Creating and Using a Parcel Repository for Cloudera Manager

By | Big Data | No Comments

This blog post describes how to create a hosted Cloudera repository and use it in your Cloudera Manager deployment. The first step is to install a web server, which will host RPM packages and repodata. The common way, is to use an Apache web server. Installing Apache HTTPD service [cdhtest@edge ]$ sudo yum install httpd -y   Starting Apache HTTPD service [cdhtest@edge ]$ sudo systemctl start httpd Verify that the service has been started properly….

Read More
Mehdi Bada

Create an HDFS user’s home directory

By | Big Data | No Comments

Let’s assume we need to create an HDFS home directory for a user named “dbitest”. We need first to verify if the user exists on the local filesystem. It’s important to understand that HDFS is mapping users from the local filesystem. [cdhtest@master ~]$ cat /etc/passwd | grep dbitest  Create a user on the local file system When the user is not created, we can easily create one with it associated group. [cdhtest@master ~]$ sudo groupadd…

Read More
Mehdi Bada

Deploy a Cloudera cluster with Terraform and Ansible in Azure – part 3

By | Big Data, Cloud | No Comments

After the deployment step with Terraform and the configuration/installation with Ansible, we will continue the installation of our Cloudera cluster with Cloudera Manager. By following the below steps you will see how to install CDH on our hosts using Cloudera Manager. Connection First, Login to Cloudera manager URL. When you connect to C.M for the first time, you need to accept the Cloudera Terms and Conditions. Then choose your desired edition of Cloudera. For this…

Read More
Mehdi Bada

Deploy a Cloudera cluster with Terraform and Ansible in Azure – part 2

By | Big Data, Cloud, Development & Performance | No Comments

In this part of the blog posts series, we will show how ansible helps us to configure our cluster and install all pre-requisite needed for Cloudera Manager. Ansible is one of the most important automation tools currently. Ansible will help us to configure all nodes for a manual installation using Cloudera Manager. Our playbook will contain the following roles: cm_repo: add the same C.M repo into all nodes. os_config: Adjust all OS parameter for installing…

Read More