Trending questions in Data Analytics

+1 vote
1 answer

Difference between validation set and a test set

Validation set can be considered as a part ...READ MORE

Jul 24, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,526 views
0 votes
1 answer

Join multiple strings in R

Joining strings in R is quite an ...READ MORE

Jul 19, 2018 in Data Analytics by DataKing99
• 8,250 points
1,736 views
+1 vote
1 answer

How good at SQL does a data scientist really need to be?

SQL is a standardized query language for requesting information ...READ MORE

Aug 9, 2018 in Data Analytics by Abhi
• 3,720 points
743 views
0 votes
1 answer

Logistic Regression Example

Logistic Regression often referred to as the ...READ MORE

Jul 19, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,722 views
0 votes
1 answer

Can a data scientist do everything a data analyst can do?

Technically speaking, yes and no. Yes from ...READ MORE

Aug 9, 2018 in Data Analytics by Abhi
• 3,720 points
814 views
0 votes
1 answer

How to drop factor levels in a subsetted data frame?

You can use factor(ff) to drop levels ...READ MORE

Apr 17, 2018 in Data Analytics by kappa3010
• 2,090 points

edited Apr 17, 2018 by kappa3010 5,676 views
0 votes
1 answer

How can I calculate mean per group in a data.frame?

You can use aggregate function for calculating ...READ MORE

May 24, 2018 in Data Analytics by zombie
• 3,790 points
4,069 views
0 votes
1 answer

What is data science?

Data Science is the practice of: Asking questions (formulating hypothesis), ...READ MORE

Aug 3, 2018 in Data Analytics by Abhi
• 3,720 points
1,008 views
0 votes
1 answer

On a given dataset would time taken to train n - random forest be equal to time taken to train n X (Decision tree)

No, the time to train the random ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
1,390 views
0 votes
1 answer

Steps to evaluate Linear Regression in R

 These are sequential steps which need to ...READ MORE

Jul 25, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,352 views
0 votes
1 answer

Would time taken to train n-random forest be equal to n x (decision trees)

No, the time to train the random ...READ MORE

Aug 7, 2018 in Data Analytics by Abhi
• 3,720 points
1,076 views
0 votes
1 answer

How to save a DataFrame in R??

Suppose your DataFrame is named as df: write.csv(df,file="exmp.csv") Then ...READ MORE

Apr 20, 2018 in Data Analytics by DeepCoder786
• 1,720 points
5,399 views
0 votes
1 answer

Load CSV File in R

Loading a .csv file in R is ...READ MORE

Jul 28, 2018 in Data Analytics by Sahiti
• 6,370 points
1,121 views
0 votes
1 answer

How to build a Scatter-plot using plotly?

With the help of “plotly” we can ...READ MORE

Jul 6, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
2,023 views
0 votes
1 answer

How to export a graph to .eps file with R?

The easiest way that I could suggest ...READ MORE

Apr 27, 2018 in Data Analytics by Sahiti
• 6,370 points
5,023 views
0 votes
2 answers

PCA model in R

Principal component analysis (PCA) is routinely employed on ...READ MORE

Jul 19, 2018 in Data Analytics by zombie
• 3,790 points
1,411 views
0 votes
1 answer

Reshape data from long to wide format in R

Use reshape function: reshape(dat1, idvar = "name", timevar = ...READ MORE

Jun 14, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
2,856 views
0 votes
1 answer

How do I become a data scientist step by step?

I am assuming that you are a ...READ MORE

Jul 26, 2018 in Data Analytics by Abhi
• 3,720 points
882 views
0 votes
1 answer

Plot two variables as lines on the same graph using ggplot

If you have small number of variables, ...READ MORE

Apr 17, 2018 in Data Analytics by kappa3010
• 2,090 points
5,199 views
0 votes
1 answer

Components in grammar of graphics

Broadly speaking these are different components in grammar of ...READ MORE

Jul 24, 2018 in Data Analytics by Sahiti
• 6,370 points
916 views
0 votes
1 answer

List packages are used for data mining in R?

You can refer to the following packages ...READ MORE

Jul 3, 2018 in Data Analytics by DataKing99
• 8,250 points
1,789 views
0 votes
1 answer

Finding frequency of observations in R

You can use the "dplyr" package to ...READ MORE

Mar 26, 2018 in Data Analytics by Bharani
• 4,660 points
6,049 views
0 votes
1 answer

How to calculate group mean and assign it to new data in R

You can use something like this: df$grp.mean.values <- ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
2,036 views
0 votes
2 answers

What is difference between Distributed search head and Search head cluster?

 A distributed environment describes the separation of ...READ MORE

Dec 4, 2018 in Data Analytics by Ali
• 11,360 points
2,690 views
0 votes
1 answer

Data Imputation Packages

These are some packages in R which ...READ MORE

Jul 28, 2018 in Data Analytics by Sahiti
• 6,370 points
718 views
0 votes
1 answer

Remove messages from R markdown report

All you have to do is, set ...READ MORE

May 14, 2018 in Data Analytics by Bharani
• 4,660 points
3,915 views
0 votes
1 answer

Catch integer(0) in R

R's way of printing a zero length ...READ MORE

Apr 27, 2018 in Data Analytics by kappa3010
• 2,090 points
4,633 views
0 votes
1 answer

Statistical Power of Sensitivity

Sensitivity is commonly used to validate the ...READ MORE

Jul 13, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,299 views
+1 vote
1 answer

How to handle Nominal Data?

Nominal data is basically data which can ...READ MORE

Jul 24, 2018 in Data Analytics by Abhi
• 3,720 points
768 views
0 votes
1 answer

Overfitting vs Underfitting

In statistics and machine learning, one of ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,314 views
0 votes
1 answer

Selection Bias

Selection bias is the bias introduced by the ...READ MORE

Jul 11, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,267 views
0 votes
1 answer

How to remove all variables except functions in R?

One line that removes all objects except for functions: rm(list ...READ MORE

Apr 25, 2018 in Data Analytics by DeepCoder786
• 1,720 points
4,580 views
0 votes
2 answers

Apply function in R

apply Description: “Returns a vector or array or ...READ MORE

Jul 10, 2018 in Data Analytics by zombie
• 3,790 points
1,238 views
0 votes
1 answer

Confusion matrix in R

A confusion matrix can be used to ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,250 points
1,430 views
0 votes
2 answers

Clustering in R

Clustering is a type of unsupervised learning ...READ MORE

Jul 9, 2018 in Data Analytics by zombie
• 3,790 points
1,206 views
0 votes
1 answer

How to create a new R6 Class in R?

You have to first create an object ...READ MORE

Jul 5, 2018 in Data Analytics by DataKing99
• 8,250 points
1,413 views
0 votes
1 answer

How to treat missing values during analysis?

The extent of the missing values is ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
1,082 views
0 votes
1 answer

Cross Validation

Cross-validation is a model validation technique for ...READ MORE

Jul 17, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
891 views
0 votes
1 answer

How to find out cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,649 views
0 votes
1 answer

White noise model in R

The white noise (WN) model is a ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
1,084 views
0 votes
1 answer

Random Walk model in R

A random walk is a simple example ...READ MORE

Jul 10, 2018 in Data Analytics by Sahiti
• 6,370 points
1,082 views
0 votes
1 answer

Multiple plots onto a single page in R

Plotting multiple plots onto a single page ...READ MORE

Jul 17, 2018 in Data Analytics by Sahiti
• 6,370 points
705 views
0 votes
1 answer

Treat outliers in Dataset

Outlier values can be identified by using ...READ MORE

Jul 12, 2018 in Data Analytics by Sahiti
• 6,370 points
915 views
0 votes
1 answer

Drop unused levels from a data frame in R

You can use this command droplevels() y <- ...READ MORE

Jun 14, 2018 in Data Analytics by DataKing99
• 8,250 points
2,098 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,782 views
0 votes
1 answer

How to use dplyr functions such as filter() inside nested data frames with map()

You can use map() call as follows:  map(full, ...READ MORE

Apr 6, 2018 in Data Analytics by Sahiti
• 6,370 points
4,991 views
0 votes
1 answer

Create vector matrix of movie ratings using R project?

Why do'nt you try the dcast function, in the reshape2 package. d ...READ MORE

Jun 30, 2018 in Data Analytics by anonymous
1,313 views
0 votes
1 answer

How to convert a sentence to word table in R?

Try the following code: sentence <- c("case sweden", ...READ MORE

Jun 21, 2018 in Data Analytics by Sahiti
• 6,370 points
1,659 views
0 votes
1 answer

TwitteR package download of package ‘rjson’ fails in R

If you don't want to upgrade your ...READ MORE

Jun 27, 2018 in Data Analytics by CodingByHeart77
• 3,750 points
1,419 views
0 votes
1 answer

How to sample random rows in dataframe?

Create data frame and then implement as ...READ MORE

Jul 3, 2018 in Data Analytics by Sahiti
• 6,370 points
1,103 views