We present the details of a large scale user profiling framework that we developed here on Apache Hadoop. We address the problem of extracting and maintaining a very large number of user profiles extracted from large scale data. In this work, a user profiles is often used to classify a given user into pre-defined user segments or to capture the online behavior of the user including the user’s private interests and preferences. A user profiles can be explicitly defined by the user himself. User Profiling is usually defined as the process of implicitly learning a user profiles from data associated with the user. The Data extracted in stored form of the xlsx, pdf, docx format in certain Data-marts or organization is also extracted to get user information and personalize the user’s behavior accordingly. Data sources for user profiling include among others the user’s browsing sessions or even other user profiles using collaborative filtering techniques.
 U. Cetintemel, M. J. Franklin, and C. L. Giles. Self-adaptive user profiles for large-scale data delivery. In ICDE, pages 622–633, 2000
 Y. Chen, D. Pavlov, and J. F. Canny. Large-scale behavioral targeting. In KDD ’09, New York, NY, USA, 2009. ACM.
 S. Gauch, M. Speretta, A. Chandramouli, and A. Micarelli. User profiles for personalized information access. In The Adaptive Web, volume 4321 of Lecture Notes in Computer Science. Berlin, Heidelberg 2007
 Yanagimoto and S. H. Omatu. User profile creation using genetic algorithm with kullbackleibler divergence. IEEJ Transactions on Electronics, Information and Systems,126:389–394, 2006.
 Michal Shmueli-Scheuer, Haggai Roitman, David Carmel, Yosi Mass, DavidKonopnicki
 IEEE,Extracting User Profiles from Large Scale Data, Michal Shmueli-Scheuer, Haggai Roitman, David Carmel, Yosi Mass, David,2009
 International Journal of Scientific & Engineering Research, Mapreduce Performance in Heterogeneous Environments: A Review, Salma Khalil, Sameh A.Salem, Salwa Nassar and Elsayed M.Saad, April -2013.
 International Journal of Scientific & Engineering Research, A SURVEY ON BIG DATA, Amegha.K, Sowmya.B, Apoorva M.P, July-2013
[Kunal Oswal,, Saloni Mapara, , Asmita Deshmukh,, Richa Runwal, , Bilkis Chandargi, (2014), Extraction of User Profile from Library Data Set Using HADOOP, International Journal of Innovative Research in Computer Science & Technology (IJIRCST), Vol-2, Issue-2, Page No-64-67], (ISSN 2347 - 5552). www.ijircst.org
Information Technology, University of Pune/ Trinity College of engineering/ KJ’s Institute, Pune,India, Mobile No: 9604979366, (e-mail: firstname.lastname@example.org).