R / mgcv : te () 및 ti () 텐서 제품이 다른 표면을 생성하는 이유는 무엇입니까?

mgcv에 대한 패키지는 R텐서 제품의 상호 작용을 피팅에 대한 두 가지 기능이 있습니다 : te()와 ti(). 나는 둘 사이의 기본 노동 분열을 이해한다 (비선형 상호 작용에 적합하고이 상호 작용을 주요 효과와 상호 작용으로 분해). 내가 이해할 수없는 것은 왜 te(x1, x2)와 ti(x1) + ti(x2) + ti(x1, x2)(약간) 다른 결과가 발생할 수 있습니다.

MWE (에서 적응 ?ti) :

require(mgcv)
test1 <- function(x,z,sx=0.3,sz=0.4) { 
  x <- x*20
 (pi**sx*sz)*(1.2*exp(-(x-0.2)^2/sx^2-(z-0.3)^2/sz^2)+
             0.8*exp(-(x-0.7)^2/sx^2-(z-0.8)^2/sz^2))
}
n <- 500

x <- runif(n)/20;z <- runif(n);
xs <- seq(0,1,length=30)/20;zs <- seq(0,1,length=30)
pr <- data.frame(x=rep(xs,30),z=rep(zs,rep(30,30)))
truth <- matrix(test1(pr$x,pr$z),30,30)
f <- test1(x,z)
y <- f + rnorm(n)*0.2

par(mfrow = c(2,2))

# Model with te()
b2 <- gam(y~te(x,z))
vis.gam(b2, plot.type = "contour", color = "terrain", main = "tensor product")

# Model with ti(a) + ti(b) + ti(a,b)
b3 <- gam(y~ ti(x) + ti(z) + ti(x,z))
vis.gam(b3, plot.type = "contour", color = "terrain", main = "tensor anova")

# Scatterplot of prediction b2/b3
plot(predict(b2), predict(b3))

이 예제에서 차이점은 그다지 크지 않지만 차이점이 왜 있는지 궁금합니다.

세션 정보 :

 > devtools::session_info("mgcv")
 Session info
 -----------------------------------------------------------------------------------
 setting  value                       
 version  R version 3.3.1 (2016-06-21)
 system   x86_64, linux-gnu           
 ui       RStudio (0.99.491)          
 language en_US                       
 collate  en_US.UTF-8                 
 tz       <NA>                        
 date     2016-09-13                  

 Packages      ---------------------------------------------------------------------------------------
 package * version date       source        
 lattice   0.20-33 2015-07-14 CRAN (R 3.2.1)
 Matrix    1.2-6   2016-05-02 CRAN (R 3.3.0)
 mgcv    * 1.8-12  2016-03-03 CRAN (R 3.2.3)
 nlme    * 3.1-128 2016-05-10 CRAN (R 3.3.1)

r gam mgcv conditional-probability mixed-model references bayesian estimation conditional-probability machine-learning optimization gradient-descent r hypothesis-testing wilcoxon-mann-whitney time-series bayesian inference change-point time-series anova repeated-measures statistical-significance bayesian contingency-tables regression prediction quantiles classification auc k-means scikit-learn regression spatial circular-statistics t-test effect-size cohens-d r cross-validation feature-selection caret machine-learning modeling python optimization frequentist correlation sample-size normalization group-differences heteroscedasticity independence generalized-least-squares lme4-nlme references mcmc metropolis-hastings optimization r logistic feature-selection separation clustering k-means normal-distribution gaussian-mixture kullback-leibler java spark-mllib data-visualization categorical-data barplot hypothesis-testing statistical-significance chi-squared type-i-and-ii-errors pca scikit-learn conditional-expectation statistical-significance meta-analysis intuition r time-series multivariate-analysis garch machine-learning classification data-mining missing-data cart regression cross-validation matrix-decomposition categorical-data repeated-measures chi-squared assumptions contingency-tables prediction binary-data trend test-for-trend matrix-inverse anova categorical-data regression-coefficients standard-error r distributions exponential interarrival-time copula log-likelihood time-series forecasting prediction-interval mean standard-error meta-analysis meta-regression network-meta-analysis systematic-review normal-distribution multiple-regression generalized-linear-model poisson-distribution poisson-regression r sas cohens-kappa

— jvh_ch
소스

진심으로 사람들!? 분명히 mgcv 특정 일에 구현 동안 (내가 모르고있어 모든 문제와 답은 명확 통계입니다이 ANOVA와 같은 이변 부드럽게의 분해가 있습니다 GAMS 다른 상용 소프트웨어); "상호 작용"구성 요소에서 한계 항을 분해 할 때 발생하는 추가 페널티 매트릭스로 인해 적합하지 않은 모델이 후드에서 동일하지 않습니다. 이것은 mgcv에만 국한되지 않습니다.

— 개빈 심슨

@GavinSimpson 나는 메타 에 관한 주제에 관한 질문을 제기 했다. 또는 그렇지 않다면

— Silverfish

이것들은 표면적으로 동일한 모델이지만 실제로 피팅 할 때 약간의 미묘한 차이가 있습니다. 한 가지 중요한 차이점은 ti()항이 있는 모형이 모형과 비교하여 더 평활도 매개 변수를 추정한다는 것입니다 te().

> b2$sp
te(x,z)1 te(x,z)2 
3.479997 5.884272 
> b3$sp
    ti(x)     ti(z)  ti(x,z)1  ti(x,z)2 
 8.168742 60.456559  2.370604  2.761823

이것은 두 모델과 관련된 더 많은 페널티 매트릭스가 있기 때문입니다. 에서 ti()모델 우리는 단지 두에 비해 "용어"당이 te()모델 한계 부드러운 하나씩.

또는ti() 원하는지 결정하는 데 사용되는 모델이 있습니다 . 용어를 사용하면이 모델을 비교할 수 없으므로를 사용 합니다. 가 필요한지 확인한 후에는 모델이 필요한 경우 또는 가 필요하지 않은 경우 각 한계 효과에 대해 별도로 모델을 다시 맞출 수 있습니다 . $\hat{y} = \beta_0 + s(x, y)$ $\hat{y} = \beta_0 + s(x) + s(y)$ te()ti() $s(x,y)$ te()s() $s(x,y)$

사용 피팅에 의해 서로 가까이에 다소 모델을 얻을 수 있습니다 method = "ML"(또는 "REML", 그러나 당신이 "고정"효과를 비교해서는 안 "REML"모든 조건이 완전히 기본적으로 그렇지 않은하는 범하지 않는,하지만 말 것 와 함께 select = TRUE).

— 개빈 심슨
소스