DNS problem in BIG DATA

In Big data ecosystem all the nodes have hostname in a cluster, when we are trying to access ResoureManager, NodeManager, Application Master,Job History server, Spark UI, SparkHistroyServer we always change the hostname to IP address, each and every time we are doing it, especially when we trying to check job progress, metrics, job logs we are navigating from ResourceManager to ApllicationMaster/HistoryServer. Believe me its a headache to change it each and every time.

There is a solution to overcome this problem. Follow below simple steps 
  1. In Windows machine, open Notepad++ as Admin user
  2. Browse C:\Windows\System32\drivers\etc\hosts this file, edit file in Notepad++.
  3. Add ipaddress hostname to the file 
  4. Save that file, now you can able to access the website without need to change hostname to IP address. 
Refer below hosts file opened in Notepad++ with highlighted area those entry are added now to access

Comments

  1. "we always change the hostname to IP address" - why would you do this and what does it have to do with "big data"?

    ReplyDelete
    Replies
    1. Normally companies will use private ip to communicate within a cluster of nodes for big data hadoop ecosystem. so the privateip is mapped to hostname, not with the public ip.,
      which is why we need to map hostname to public ip.

      Delete

Post a Comment

Popular posts from this blog

Avro Secondary Sorting

Builder Design Pattern

Git Bundle