请显示每个公司的平均值和最大值的回归循环,并报告系数:
lm(colmean$MSFT~colmax$MSFT)
lm(colmean$AAPL~colmax$AAPL)
lm(colmean$GOOGL~colmax$GOOGL)
Data:
> head(colmax)
MSFT AAPL GOOGL
1 21.23999 5.201410 97.810
2 21.38546 6.096268 105.430
3 20.64884 6.019837 94.405
> head(colmean)
MSFT AAPL GOOGL
1 21.11067 4.975767 94.04000
2 20.91273 5.663524 97.50684
3 20.05333 5.681336 90.57909
答案 0 :(得分:0)
您可以尝试import org.apache.spark.sql.types.StructType
val nested_fields = df.schema
.filter(c => c.name == "b2")
.flatMap(_.dataType.asInstanceOf[StructType].fields)
.map(_.name)
// nested_fields: Seq[String] = List(a1, a2)
,最终结果将位于列表中,其中每个元素都适合每列:
lapply
或fit <- lapply(names(colmax), function(x){
lm(colmean[[x]] ~ colmax[[x]])
})
:
mapply