By Alexey Grigorev
- An evaluate of recent info technology and computing device studying libraries to be had in Java
- Coverage of a wide set of subject matters, going from the fundamentals of computing device studying to Deep studying and large facts frameworks.
- Easy-to-follow illustrations and the operating instance of establishing a seek engine.
Java is the preferred programming language, in accordance with the TIOBE index, and it's a ordinary selection for working construction platforms in lots of businesses, either within the startup global and between huge enterprises.
Not unusually, it's also a typical selection for growing information technology functions: it truly is speedy and has a very good set of information processing instruments, either integrated and exterior. what's extra, picking out Java for facts technological know-how helps you to simply combine recommendations with current software program, and produce information technological know-how into creation with much less effort.
This booklet will train you ways to create info technological know-how purposes with Java. First, we'll revise crucial issues while beginning a knowledge technology program, after which brush up the fundamentals of Java and computing device studying earlier than diving into extra complicated themes. we commence via going over the present libraries for information processing and libraries with desktop studying algorithms. After that, we conceal subject matters resembling category and regression, dimensionality aid and clustering, details retrieval and traditional language processing, and deep studying and large data.
Finally, we end the booklet by way of speaking in regards to the how you can set up the version and review it in construction settings.
What you are going to learn
- Get an outstanding figuring out of the information processing toolbox to be had in Java
- Explore the knowledge technological know-how environment to be had in Java
- Find out tips to strategy diverse computing device studying issues of Java
- Process unstructured info corresponding to average language textual content or images
- Create your personal seek engine
- Get state of the art functionality with XGBoost
- Learn easy methods to construct deep neural networks with DeepLearning4j
- Build purposes that scale and procedure quite a lot of data
- Deploy information technology versions to construction and review their performance
About the Author
Alexey Grigorev is a talented facts scientist, laptop studying engineer, and software program developer with greater than 7 years experience.
He all started his occupation as a Java developer operating at a couple of huge and small businesses, yet after your time he switched to information technological know-how. immediately, Alexey works as an information scientist at Searchmetrics, the place, in his daily activity, he actively makes use of Java and Python for facts cleansing, information research, and modeling.
His parts of craftsmanship are desktop studying and textual content mining, yet he additionally enjoys engaged on a wide set of difficulties, that is why he frequently participates in information technological know-how competitions on structures comparable to kaggle.com.
You can connect to Alexey on LinkedIn at https://de.linkedin.com/in/agrigorev.
Table of Contents
- Data technology utilizing Java
- Data Processing Toolbox
- Exploratory info Analysis
- Supervised studying - class and Regression
- Unsupervised studying - Clustering and Dimensionality Reduction
- Working with textual content - ordinary Language Processing and knowledge Retrieval
- Extreme Gradient Boosting
- Deep studying with DeepLearning4J
- Scaling info Science
- Deploying information technology Models
Read or Download Mastering Java for Data Science PDF
Best data mining books
Information uncertainty is an idea heavily comparable with so much actual existence purposes that contain information assortment and interpretation. Examples are available in info bought with biomedical tools or different experimental recommendations. Integration of strong optimization within the latest facts mining strategies goal to create new algorithms resilient to mistakes and noise.
With today’s shoppers spending extra time on their mobiles than on their desktops, new equipment of empirical stochastic modeling have emerged that may offer sellers with specific information regarding the goods, content material, and providers their clients hope. information Mining cellular units defines the gathering of machine-sensed environmental info concerning human social habit.
Info safety Analytics delivers insights into the perform of analytics and, extra importantly, how one can make the most of analytic concepts to spot traits and outliers that will not be attainable to spot utilizing conventional defense research suggestions. details safety Analytics dispels the parable that analytics in the info safeguard area is restricted to simply safety incident and occasion administration structures and easy community research.
A number of standards choice Making (MCDM) is a subfield of Operations study, facing choice making difficulties. A decision-making challenge is characterised by way of the necessity to decide upon one or a couple of between a couple of choices. the sector of MCDM assumes targeted value during this period of massive information and enterprise Analytics.
- Bioinformatics Database Systems
- Algorithms in Bioinformatics: 15th International Workshop, WABI 2015, Atlanta, GA, USA, September 10-12, 2015, Proceedings (Lecture Notes in Computer Science)
- Data Mining for Genomics and Proteomics: Analysis of Gene and Protein Expression Data (Wiley Series on Methods and Applications in Data Mining)
- Handbook of Educational Data Mining (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)
- Data Science with Java: Practical Methods for Scientists and Engineers
Additional info for Mastering Java for Data Science