Data science is considered as the process of extracting insights or relevant information from raw data that are useful to the organizations. It is said to use diverse techniques from a lot of fields like computer science, statistics, mathematics, visualization, data engineering and much more. Data are mostly structured and unstructured data. The unstructured data are the ones that require special focus as it is raw in form and needs special attention to taking out useful insights from such a huge chunk of data. Data is available in a massive amount nowadays as it is extracted from every source be it online shopping, multimedia, applications, etc.