## Microsoft DP-100 – Designing and Implementing a Data Science Solution on Azure – free questions.

Microsoft Certified Azure Data Scientist Associate, the DP-100 exam measures your ability to accomplish technical tasks like: manage Azure resources for machine learning run experiments and train models deploy and operationalize machine learning solutions implement responsible machine learning Example Questions...

Read More## Trading with Python Intro – Data Import

Traditionally, there have been two general ways of analyzing market data: fundamental analysis – focused on underlying fundamental data technical analysis – focused on charts and price movements In recent years, computer science and mathematics revolutionized trading, it has become...

Read More## Data Scientist Interview Questions – Explain what precision and recall are?

Data Scientist Interview Questions – Explain what precision and recall are. After the predictive model has been finished, the most important question is: How good is it? Does it predict well? Evaluating the model is one of the most...

Read More## How would you validate-test a predictive model?

How would you validate-test a predictive model? Why evaluate/test model at all? Evaluating the performance of a model is one of the most important stages in predictive modeling, it indicates how successful model has been for the dataset. It enables...

Read More## Why would you use Regularization and what it is?

Why would you use Regularization and what it is? In Machine Learning, very often the task is to fit a model to a set of training data and use the fitted model to make predictions or classify new (out of...

Read More## Introduction to TensorFlow

Introduction to TensorFlow. What is TensorFlow? The shortest definition would be, TensorFlow is a general-purpose library for graph-based computation. But there is a variety of other ways to define TensorFlow, for example, Rodolfo Bonnin in his book – Building Machine...

Read More## Where to learn TensorFlow for Free?

Below a list of free resources to learn TensorFlow: TensorFlow website: www.tensorflow.org Udacity free course: www.udacity.com Google Cloud Platform: cloud.google.com Coursera free course: www.coursera.org Machine Learning with TensorFlow by Nishant Shukla : www.tensorflowbook.com ‘First Contact With TensorFlow’ by Prof. JORDI TORRES: jorditorres.org or you...

Read More## Tensor Flow Cheat Sheet.

TensorFlow Quick Reference Table – Cheat Sheet. TensorFlow is very popular deep learning library, with its complexity can be overwhelming especially for new users. Here is a short summary of often used functions, if you want to download it in...

Read More## Popular Pandas snippets used in data analysis.

Popular Pandas snippets used in data analysis. Pandas is very popular Python library for data analysis, manipulation, and visualization, I would like to share my personal view on the list of most often used functions/snippets for data analysis. 1.Import Pandas...

Read More## Numerai – deep learning example code.

In a previous post on Numerai, I have described very basic code to get into a world of machine learning competitions. This one will be a continuation, so if you haven’t read it I recommend to do it- here. In...

Read More## Intro to machine learning competitions with ‘Numerai’ – example code.

In this post, I want to share, how simple it is to start competing in machine learning tournaments – Numerai. I will go step by step, line by line explaining what is doing what and why it is required. Numerai...

Read More## Intro to Machine Learning

What is a definition of Machine Learning? Machine Learning subfield of science that provides computers with the ability to learn without being explicitly programmed. The goal of Machine Learning is to develop learning algorithms that do the learning automatically without...

Read More## What is Hadoop YARN?

Hadoop YARN is the architectural center of Hadoop that allows multiple data processing engines such as interactive SQL, real-time streaming,...

Read More## What is Hadoop Flume?

Hadoop Flume was created in the course of incubator Apache project to allow you to flow data from a source...

Read More## What is Apache Kafka?

Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation written in Scala and Java. The...

Read More## What is Hadoop Zookeeper?

Hadoop Zookeeper is an open source Apache™ project that provides a centralized infrastructure and services that enable synchronization across a...

Read More## What is Hadoop Hbase?

Hadoop Hbase is a column-oriented database management system that runs on top of HDFS. It is well suited for sparse...

Read More## What is Hadoop Sqoop?

Hadoop Sqoop efficiently transfers bulk data between Apache Hadoop and structured datastores such as relational databases. Sqoop helps offload certain...

Read More## What is Hadoop Hive?

Hadoop Hive is a runtime Hadoop support structure that allows anyone who is already fluent with SQL (which is commonplace...

Read More## What is Hadoop Pig?

Hadoop Pig was initially developed at Yahoo to allow people using Hadoop to focus more on analyzing large datasets and...

Read More## What is Z-Score or Standard Score?

Z-Score or Standard Score in statistics is the signed number of standard deviations by which the value of an observation...

Read More## What is Unsupervised Learning?

Unsupervised Learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without...

Read More## What is Type II Error?

Type II Error in statistical hypothesis testing is incorrectly retaining a false null hypothesis (a “false negative”). A type II...

Read More## What is Type I Error?

Type I Error in statistical hypothesis testing is the incorrect rejection of a true null hypothesis (a false positive). More...

Read More## What is True Positive Rate (Sensitivity)?

True Positive Rate (Sensitivity) is a statistical measure which measures the proportion of positives that are correctly identified as such...

Read More## What is True Negative Rate (Specificity)?

True Negative Rate (Specificity) is a statistical measure which measures the proportion of negatives that are correctly identified as such...

Read More## What is Three Sigma Rule?

Three Sigma Rule in the empirical sciences express a conventional heuristic that “nearly all” values are taken to lie within...

Read More## What is Support Vector Machines (SVM)?

Support Vector Machines (SVM) is a discriminative classifier formally defined by a separating hyperplane. In other words, given labeled training...

Read More## What is Supervised Learning?

Supervised Learning is the machine learning task of inferring a function from labeled training data. The training data consist of...

Read More## What is Statistical Significance?

Statistical Significance in statistical hypothesis testing is attained whenever the observed p-value of a test statistic is less than the...

Read More## What is Statistical Power?

Statistical Power of any test of statistical significance is defined as the probability that it will reject a false null...

Read More## What is Sentiment Analysis?

Sentiment Analysis refers to the use of natural language processing, text analysis, computational linguistics, and biometrics to systematically identify, extract,...

Read More## What is Semi-Supervised Learning?

Semi-Supervised Learning is a class of supervised learning tasks that also make use of unlabeled data for training – typically...

Read More