我正在尝试复制此代码
https://rstudio-pubs-static.s3.amazonaws.com/298981_faf93037ea1d4dd6b99b4f7764fd87bd.html
数据在这里
https://www.kaggle.com/c/nyc-taxi-trip-duration/data
我已经尝试过此解决方案
sample_data <- train_kaggle[sample(1:dim(train_kaggle)[1],100000),]
ggplot(mapping <-
aes(x=sample_data$pickup_longitude, y=sample_data$pickup_latitude))+
geom_point(alpha=.2,cex=0.0001,aes(colour=sample_data$pickupclus))+
scale_x_continuous(limits = c(-74.02,-73.85))+
scale_y_continuous(limits = c(40.7,40.85))+
ggtitle('Pick up locations clustered')+
guides(fill=F)+
xlab('Longitude')+ylab('Latitude')+ggtitle('Plot of pick up locations
with
clustering')+
theme(legend.position = 'none')