Big Data is a new term used to identify the datasets that are large in size and complexity. In this paper provide the overview of the process for data mining for big data. We address the current issues and challenges in big data mining process compared to the traditional data mining, the. Big Data Mining is one of the most excited research challenges in coming years. One of the key issues raised by data mining technology is not a business or technological one, but a social one. Other issues are that of data integrity, Analytics Architecture, Evaluation, Visualization and Distributed mining are deliberated.
D. Feldman, M. Schmidt, and C. Sohler. Turning big data into tiny data: Constant-size coresets for k-means, pca and projective clustering. In SODA, 2013.
Agrawal, D., Bernstein, P., Bertino, E., et al.: Challenges and Opportunities With big data A Community White Paper Developed by Leading Researchers Across the United States (2012), http://cra.org/ccc/docs/ init/bigdatawhite paper.pdf.
U. Fayyad. Big Data Analytics: Applications and Opportunities in On-line Predictive Modeling. http:// big-data-mining.org/keynotes/#fayyad, 2012.
A. Bifet, G. Holmes, R. Kirkby, and B. Pfahringer. MOA: Massive Online Analysis http://moa. cms.waikato.ac.nz/. Journal of Machine Learning Research (JMLR), 2010.
B. Efron. Large-Scale Inference: Empirical Bayes Methods for Estimation, Testing, and Prediction. Institute of Mathematical Statistics Monographs. Cambridge University Press, 2010.
Berkovich, S., Liao, D.: On Clusterization of big data Streams. In: 3rd International Conference on Computing for Geospatial Research and Applications, article no. 26. ACM Press, New York (2012)
Beyer, M.A., Laney, D.: The Importance of ‘Big Data’: A Definition. Gartner (2012)
Madden, S.: From Databases to big data. IEEE Internet Computing 16(3), 4–6 (2012)
Shmueli, G., Patel, N.R., Bruce, P.C.: Data Mining for Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner, 2nd edn. Wiley & Sons, Hoboken (2010)
Ghoting, A., Kambadur, P., Pednault, E., Kannan, R.: NIMBLE: a Toolkit for the Implementation of Parallel Data Mining and Machine Learning Algorithms on MapReduce. In: 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, California, USA, pp. 334–342 (2011).
B R Prakash, Dr. M. Hanumanthappa: Issues and Challenges in the Era of Big Data Mining International Journal of Emerging Trends & Technology in Computer Science Volume 3, Issue 4, July - August 2014 ISSN 2278-6856.
Low, Y., Bickson, D., Gonzalez, J., Guestrin, C., Kyrola, A., Hellerstein, J.M.: Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud. VLDB Endowment 5(8), 71–727 (2012)
Borkar, V.R., Carey, M.J., Li, C.: big data Platforms: What’s Next? ACM Crossroads 19(1), 44–49 (2012)
L. Backstrom, P. Boldi, M. Rosa, J. Ugander, and S. Vigna. Four Degrees of Separation. CoRR, abs/1111.4570, 2011.