By Alexey Grigorev

Key Features

  • An evaluate of recent info technology and computing device studying libraries to be had in Java
  • Coverage of a wide set of subject matters, going from the fundamentals of computing device studying to Deep studying and large facts frameworks.
  • Easy-to-follow illustrations and the operating instance of establishing a seek engine.

Book Description

Java is the preferred programming language, in accordance with the TIOBE index, and it's a ordinary selection for working construction platforms in lots of businesses, either within the startup global and between huge enterprises.

Not unusually, it's also a typical selection for growing information technology functions: it truly is speedy and has a very good set of information processing instruments, either integrated and exterior. what's extra, picking out Java for facts technological know-how helps you to simply combine recommendations with current software program, and produce information technological know-how into creation with much less effort.

This booklet will train you ways to create info technological know-how purposes with Java. First, we'll revise crucial issues while beginning a knowledge technology program, after which brush up the fundamentals of Java and computing device studying earlier than diving into extra complicated themes. we commence via going over the present libraries for information processing and libraries with desktop studying algorithms. After that, we conceal subject matters resembling category and regression, dimensionality aid and clustering, details retrieval and traditional language processing, and deep studying and large data.

Finally, we end the booklet by way of speaking in regards to the how you can set up the version and review it in construction settings.

What you are going to learn

  • Get an outstanding figuring out of the information processing toolbox to be had in Java
  • Explore the knowledge technological know-how environment to be had in Java
  • Find out tips to strategy diverse computing device studying issues of Java
  • Process unstructured info corresponding to average language textual content or images
  • Create your personal seek engine
  • Get state of the art functionality with XGBoost
  • Learn easy methods to construct deep neural networks with DeepLearning4j
  • Build purposes that scale and procedure quite a lot of data
  • Deploy information technology versions to construction and review their performance

About the Author

Alexey Grigorev is a talented facts scientist, laptop studying engineer, and software program developer with greater than 7 years experience.

He all started his occupation as a Java developer operating at a couple of huge and small businesses, yet after your time he switched to information technological know-how. immediately, Alexey works as an information scientist at Searchmetrics, the place, in his daily activity, he actively makes use of Java and Python for facts cleansing, information research, and modeling.

His parts of craftsmanship are desktop studying and textual content mining, yet he additionally enjoys engaged on a wide set of difficulties, that is why he frequently participates in information technological know-how competitions on structures comparable to kaggle.com.

You can connect to Alexey on LinkedIn at https://de.linkedin.com/in/agrigorev.

Table of Contents

  1. Data technology utilizing Java
  2. Data Processing Toolbox
  3. Exploratory info Analysis
  4. Supervised studying - class and Regression
  5. Unsupervised studying - Clustering and Dimensionality Reduction
  6. Working with textual content - ordinary Language Processing and knowledge Retrieval
  7. Extreme Gradient Boosting
  8. Deep studying with DeepLearning4J
  9. Scaling info Science
  10. Deploying information technology Models

Show description

Read or Download Mastering Java for Data Science PDF

Best data mining books

Robust Data Mining (SpringerBriefs in Optimization)

Information uncertainty is an idea heavily comparable with so much actual existence purposes that contain information assortment and interpretation. Examples are available in info bought with biomedical tools or different experimental recommendations. Integration of strong optimization within the latest facts mining strategies goal to create new algorithms resilient to mistakes and noise.

Data Mining Mobile Devices

With today’s shoppers spending extra time on their mobiles than on their desktops, new equipment of empirical stochastic modeling have emerged that may offer sellers with specific information regarding the goods, content material, and providers their clients hope. information Mining cellular units defines the gathering of machine-sensed environmental info concerning human social habit.

Information Security Analytics: Finding Security Insights, Patterns, and Anomalies in Big Data

Info safety Analytics delivers insights into the perform of analytics and, extra importantly, how one can make the most of analytic concepts to spot traits and outliers that will not be attainable to spot utilizing conventional defense research suggestions. details safety Analytics dispels the parable that analytics in the info safeguard area is restricted to simply safety incident and occasion administration structures and easy community research.

Big Data Analytics Using Multiple Criteria Decision-Making Models (Operations Research Series)

A number of standards choice Making (MCDM) is a subfield of Operations study, facing choice making difficulties. A decision-making challenge is characterised by way of the necessity to decide upon one or a couple of between a couple of choices. the sector of MCDM assumes targeted value during this period of massive information and enterprise Analytics.

Additional info for Mastering Java for Data Science

Example text

Download PDF sample

Rated 4.83 of 5 – based on 34 votes