Local Linear Regression Estimator on the Boundary Correction in Nonparametric Regression Estimation

Langat Reuben Cheruiyot

doi:10.2991/jsta.d.201016.001

<Previous Article In Issue

Download article (PDF)

Volume 19, Issue 3, September 2020, Pages 460 - 471

Local Linear Regression Estimator on the Boundary Correction in Nonparametric Regression Estimation

Authors

Langat Reuben Cheruiyot^*

Department of Mathematics and Computer Sciences, School of Science and Technology, University of Kabianga, Kericho, Kenya

^*Email: rlangat@kabianga.ac.ke

Corresponding Author

Langat Reuben Cheruiyot

Received 15 June 2020, Accepted 13 October 2020, Available Online 23 October 2020.

DOI: 10.2991/jsta.d.201016.001 How to use a DOI?
Keywords: Kernel estimators; Nonparametric regression estimation; Local linear regression; Bias; Variance; Asymptotic mean integrated square error (AMISE)
Abstract: The precision and accuracy of any estimation can inform one whether to use or not to use the estimated values. It is the crux of the matter to many if not all statisticians. For this to be realized biases of the estimates are normally checked and eliminated or at least minimized. Even with this in mind getting a model that fits the data well can be a challenge. There are many situations where parametric estimation is disadvantageous because of the possible misspecification of the model. Under such circumstance, many researchers normally allow the data to suggest a model for itself in the technique that has become so popular in recent years called the nonparametric regression estimation. In this technique the use of kernel estimators is common. This paper explores the famous Nadaraya–Watson estimator and local linear regression estimator on the boundary bias. A global measure of error criterion-asymptotic mean integrated square error (AMISE) has been computed from simulated data at the empirical stage to assess the performance of the two estimators in regression estimation. This study shows that local linear regression estimator has a sterling performance over the standard Nadaraya–Watson estimator.
Copyright: © 2020 The Authors. Published by Atlantis Press B.V.
Open Access: This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

1. INTRODUCTION

In nonparametric estimation, one aspect that is of importance is the smoothing that makes the interpretation of data possible. Smoothing in essence, consists of creating an approximating function that attempts to capture only the important patterns, while filtering noise and ignoring the data structures that are deemed not relevant [1]. This nonparametric technique is applicable both to density and regression estimation. Other common techniques in nonparametric regression estimation, the focus in this paper, are the spline regression, orthogonal series, and kernel regression. This paper explores the kernel regression estimation under the larger framework of the model-based estimation. One motivating research is due to Dorfman [2] who used the Nadaraya–Watson estimator also known as the local constant to construct the finite population estimator. Even with this boundary drawback, the estimator of the population total was still better than the Design-based Horvitz–Thompson estimator [3]. Many researchers in the past have done studies incorporating the Nadaraya–Watson technique. A few examples include Hall and Presnel [4], Cai [5], and Salha and Ahmed [6] who carried out a study about this. They proposed a weighted Nadaraya–Watson estimator for use in the context of conditional distribution estimation. In their studies they found out that weighted Nadarya–Watson was an improvement. In particular Cai [5] claimed that it was easier to implement Nadaraya–Watson estimator than the local linear. Herbert et al. [7] in their study suggested using jackknife technique to improve the Nadaraya–Watson estimator for finite population total. Generally, it is however known in literature, that the standard Nadaraya–Watson estimator suffers from the boundary bias and as a result the prime focus in this study is the boundary correction using the local linear estimator.

This paper has been organized as follows: in Section 2, we give a brief review of the literature regarding nonparametric regression and kernel regression estimation. In Section 3, we derive the bias and the variance of Nadaya–Watson and local linear regression estimators. The boundary correction shall be shown in this section as well. Empirical analysis has been done in Section 4 using some artificially simulated datasets. Discussion of results and conclusion is given in Section 5.

2. LITERATURE REVIEW

In this section review of literature on nonparametric regression estimation has been done. Also included in this section is the literature on kernel regression estimators.

2.1. Nonparametric Regression Estimation

Nonparametric regression estimation has been carried out by many researchers in many studies. Dorfman [2] did a comparison between the famous design-based Horvitz–Thompson estimator and the nonparametric regression estimator developed by using Nadaraya–Watson estimator [8,9]. In his study, he found out that the kernel-based nonparametric regression estimator better reflects the structure of the data and hence yields greater efficiency. This regression estimator, however, suffered the so-called boundary bias besides facing bandwidth selection challenges. Breidt and Opsomer [10] did a similar study on nonparametric regression estimation of finite population under two-stage sampling. Their study also reveals that the nonparametric regression with the application of local polynomial regression technique dominated the Horvitz–Thompson estimator and improved greatly the Nadaraya–Watson estimator. Breidt et al. [11] in their results also show that the nonparametric regression estimation is superior to the standard parametric estimators when the model regression function is incorrectly specified, while being nearly as efficient when the parametric specification is correct.

2.2. AMISE in Kernel Regression Estimators

The key properties that a statistician would be interested to check given an estimator are the variance and the bias. These two can enable one to measure the amount of accuracy and precision that an estimator has. In fact at an arbitrary fixed point, a basic measure of accuracy that takes into account both the bias and variance is the mean square error (MSE), Tsybakov [12], Härdle [13], Takezawa [14], and Härdle et al. [15]. In nonparametric regression estimation, one would be interested in the cumulative amount of bias and the variance over the entire regression line. This global measure called mean integrated square error (MISE) is obtained by finding the integral value of the variance and the square of the bias of the estimator over the entire line [16]. Taylors' expansion is normally used to obtain the asymptotic mean integrated square error (AMISE). Using differential calculus an optimal bandwidth can be obtained based on this AMISE. Among researchers who have carried out studies using these measures include Manzoor et al. [17]. Given the asymptotic properties one can deduce the speed of convergence of the estimators and determine the price to pay in a given option. It is from this literature that in Section 3, this study uses these measures in the analysis stage to compare the proposed estimator against the standard ones reported.

3. METHODOLOGY

This section gives the various techniques that have been used in this study to perform the regression estimation. More specifically the properties of each of the estimators have been given.

3.1. Properties of the Kernel Regression Estimators

A model-based nonparametric model is conventionally of the form

$Yi=mXi+ei$ (1)

where

$i=1,2,…,n$

Y_i is the variable of interest

X_i is the auxiliary variable

m is an unknown function to be determined using sample data

e_i is error term-assumed to be $N0,σ2$ under the model.

The pairs of (X_i, Y_i) is a 2-dimensional bivariate random variables in the sample space.

The idea of nonparametric regression has gained prominence in a couple of decades now. The recent advancement in technology and computers has enabled researchers to handle the massive computation experienced with this approach. This section gives a brief derivation of Nadaraya–Watson estimator.

3.2. Review of the Nadaraya–Watson Estimator

Let K(.) denote a kernel function which is also twice continuously differentiable, such that

$(a) ∫K(z)dz=1 (b) ∫zK(z)dz=0 (c) ∫z2K(z)dz:=K2(K)(<∞)$ (2)

Further, let the smoothing weight with a bandwidth h be

$wi(x)=Kx−Xih∑sKx−Xih, i=1,2,…,n$ (3)

Assume a model of the form specified in (1), the Nadaraya–Watson estimator of m(x) is therefore given by

$m^NW(x)=∑i=1nwi(x)Yi=∑i=1nKx−XihYi∑sKx−Xih$ (4)

Incorporating the Gaussian kernel function the value of $y^$ at any point x can be obtained as

$y^=m^(x)=∑i=1nwi(x)Yi=∑i=1n12πexp−12x−Xih2Yi∑i=1n12πexp−12x−Xih2$ (5)

This gives a way of estimating the nonsample values of y given x the auxiliary value. The nonparametric estimator for the finite population total is thus given by

$T^np=∑i=1nyi+∑i=n+1Nm^NWxi$ (6)

The equation in (6) was first suggested by Dorfman [2]. For kernel regression estimator, the estimate of m at point x is obtained using a weighted function of observations in the h-neighborhood of x. The weight given to each observation in the neighborhood depends on the choice of kernel function.

For example a uniform kernel function would assign the same weights to all the points within its window while the bi-weight kernel function on the other hand, assigns more weight to the points closest to the target and diminishes the weights in those points that are “farthest away” from the center of the kernel.

Choosing an appropriate kernel and a suitable bandwidth is quite important in nonparametric regression. It is known that the two (i.e., kernel function and the bandwidth), do not have the same effects—in terms of their contribution in the estimate. Previous studies reveal that compared to the kernel function which has the least impact while bandwidth selection plays a more crucial part in obtaining good estimates [18].

3.2.1. Properties of the Nadaraya–Watson estimator

The bias term of a Nadaraya–Watson estimator can be shown to be given by

$Biasm^NW(x)=h2K2(K)12m″(x)+f(x)−1f′(x)m′(x)+oh2$ (7)

where f(x) is the marginal density function of the denominator in (4) and

$K2(K)$ is given in Equation (2).

The variance term can be shown to be

$Varm^NW(x)=R(K)σ2nhf(x)+o1nh$ (8)

where R(K) is the roughness of the kernel, i.e.,

$R(K)=∫−∞∞K(z)2dz$ and

$σ2$ is the conditional variance given by

$Eei 2|Xi$ . The MSE term may therefore be obtained as

$MSEm^NW(x)=R(K)σ2nhf(x)+14h4K2 2(K)m″(x)+2f′(x)m′(x)f(x)2+oh4+o1nh$ (9)

and the AMISE is thus given by

$AMISEm^NW(x)=∫R(K)σ2nhf(x)dx+14h4K2 2(K)∫m″(x)+2f′(x)m′(x)f(x)2dx$ (10)

The proofs of these terms have not been done in this paper. Readers who may be interested can refer to [19]. A critical look at these properties shows that the bandwidth h is directly proportional to the bias while being inversely proportional to the variance. This implies that larger a value of h increases the size of the bias but at the same time reduces the variance. The opposite is true for small value of the bandwidth. See Figure 1 for an insight of this. From this fact, it is obvious that an optimal bandwidth that minimizes the AMISE measurement criterion is required.

3.2.2. Optimal bandwidth of the Nadaraya–Watson estimator

The bandwidth that optimizes Nadaraya–Watson estimator can be found by minimizing the AMISE function given in (10) with respect to h. The first derivative is given by

$dAMISEm^(x)dh=ddh∫R(K)σ2nhf(x)dx+14h4K2 2(K)∫m″(x)+2f′(x)m′(x)f(x)2dx =h3K22(K)∫−∞∞m″(x)+2f′(x)m′(x)f(x)2dx−R(K)σ2nh2f(x)$

Equating this to zero and solving for h yields an optimal bandwidth, $hoptNW$ , for a given p.d.f and kernel:

$hoptNW=R(K)σ2nf(x)K22(K)∫−∞∞m″(x)+2f′(x)m′(x)f(x)2dx15$ (11)

3.3. Review of the Local Polynomial Regression Estimator

Let (X₁, Y₁), (X₂, Y₂), …, (X_n, Y_n) be a random sample of a bivariate data taken from a finite population. From the model in (1), to estimate the unknown function m(X_i), the procedure below follows:

$m(x)=EY/X=x$

This can be approximated using Taylor's series as

$mxj=m(x)+m′(x)xj−x+m″(x)xj−x22+…+mp(x)xj−xpp!$ (12)

for x_j in the neighborhood of x. In this neighborhood the regression type is that of local polynomial fit. This fit, in some texts for example Lászlό et al. [20], is also known as local polynomial kernel estimate. To achieve this, a kernel function has to be incorporated so that the kernel weights in the minimization problem are given by

$minβ∑i=1nYi−β0−β1Xi−x−β2Xi−x2−…−βpXi−xp2KXi−xh$ (13)

where

$β=β0,β1,β2,…βpT$

This gives the weighted LSE having the weights: $KXi−xh$

So that letting

$X=1(X1−x)(X1−x)2…(X1−x)p1(X2−x)(X2−x)2…(X2−x)p⋮⋮⋮⋮⋯⋮1(Xn−x)(Xn−x)2…(Xn−x)p, Y=Y1Y2⋮Ynand$

$W=KX1−xh0⋯00KX2−xh⋯0⋮⋮⋱⋮00⋯KXn−xh$

enables the computation of

$β^$ to minimize (13) using the usual weighted LSE.

i.e.,

$β^(x)=XTWX−1XTWY$ (14)

The degree of the local polynomial regression has an implication on the estimate in terms of how it addresses the boundary and the interior biases as well as the effect on variance. The polynomials of degree more than three have been known to reduce the bias much more the local linear at the interior but at the cost of high variance. The same local polynomials however, are poorer at the boundary. It is also known that the odd order degree dominate those of even order. From there researches it can be concluded that it suffices to keep the order of the local polynomial regression low and concentrate on adjusting the bandwidth. Härdle et al. [15] and Avery [21] among others can be referred for this. Because of this fact, this study confines itself to the local linear regression estimation which is achieved from the local polynomial regression of degree one. It is of interest to note that Nadaraya–Watson estimator is a special case of the local polynomial regression estimator with the degree kept at zero.

3.3.1. The properties of local linear regression estimator

In this paper the properties of this estimator have been stated. More detailed theoretical derivations can be found in [15].

The bias of local linear regression is given by

$Biasm^LL(x)|X1,X2,…,Xn=12m″(x)h2K2(K)+oh2$ (15)

While the conditional variance is given by

$Varm^LL(x)|X1,X2,…,Xn=R(K)σ2nhf(x)+o1nh$ (16)

Thus the MSE of $m^LL(x)$ is

$MSEm^LL(x)=R(K)σ2nhf(x)+14m″(x)2h4K2 2(K)+oh4+o1nh$ (17)

Integrating the MSE in (17) gives us the MISE of the local linear estimator. This is given by

$MISEm^LL(x)=∫R(K)σ2nhf(x)dx+14h4K2 2(K)∫m″(x)2dx+o1nh+h4$ (18)

Hence the asymptotic MISE is

$AMISEm^LL(x)=∫R(K)σ2nhf(x)dx+14h4K2 2(K)∫m″(x)2dx$ (19)

3.3.2. The optimal bandwidth for the local linear regression estimator

As it was done with the local constant the optimal bandwidth for the local linear estimator can be obtained by minimizing the expression in (19) to have

$dAMISEm^LL(x)dh=ddh∫R(K)σ2nhf(x)dx+14h4K2 2(K)∫m″(x)2dx =h3K22(K)∫−∞∞m″(x)2dx−R(K)σ2nh2f(x)$

Solving this for the bandwidth, h;

$⇒hoptLL=R(K)σ2nf(x)K2 2(K)∫m″(x)dx15$ (20)

The application of these optimal bandwidths can generally be challenging because there are a number of unknown functions that require estimation first such as the marginal density $f(x)$ and the derivatives $m′(x)$ , $m″(x)$ . From the researches that have been done bandwidth selectors can be categorized into two types the “plug-in” methods recommended by [18] as well as [22] and the cross-validation methods recommended by Loader [23]. Since our focus in this paper is not really on the bandwidth our readers can refer to the findings of Ruppert et al. [24] and the mentioned researchers for the details.

4. EMPIRICAL STUDY

Data simulation was done using the model in (1) so that $mXi=10−X3$ where $X~U(1,2)$ and $e~N0,0.5$ . This was aimed at portraying how the two estimators perform in regression estimation and more specifically at the boundary. The resulting graphs that show how the estimators perform with respect to change in bandwidth have been given in Figures 2 and 3, respectively. The Nadaraya–Watson estimator has been graphed together with the local polynomial regression estimators of varying degrees but with special focus at the interior and the boundaries have been given in Figure 4. The last figures, i.e., Figures 5 and 6, give special graphs of the two estimators, i.e., the Nadaraya–Watson and the local linear estimators have been compared. In particular Figure 5 is given as the sample size increases while Figure 6 shows graphs for specific datasets in r. The area of focus is on the correction of the boundary bias. To compute the respective AMISEs, simulation was done using the models tabulated in Table 1. The other tables show these computations as the sample size changes. Further to this, inbuilt datasets of eruptions and waiting in “faithful” geysers, boiling points against barometric pressure in “forbes” and Petal lengths against petal widths in “iris” flowers, have also been used in a similar computation. The results are presented in Table 2.

Model	Equation
1: Cubic	$Y=10−X3+e~N0,0.5$ where $X~U1,2$
2: Bump	$Y=1+2X−0.5+exp−200X−0.52+e~N0,0.5$ where $X~U1,2$
3: Quadratic	$Y=1+2X−0.52+e~N(0,0.5)$ where $X~U1,2$
4: Linear	$Y=1+2X−0.5+e~N(0,0.05)$ , where $X~U1,2$
5: Exponential	$Y=exp−4X+e~N0,0.0015$ , where $X~U1,2$

Table 1

Models used in simulation.

Dataset	Nature of Boundary Bias	Estimator	AMISE	h AMISE
Faithful dataset	No boundary correction	NW	0.00027040	7.781472
	Boundary corrected	LL	0.00024807	7.375393
Forbes dataset	No boundary correction	NW	0.00075248	2.278702
	Boundary corrected	LL	0.00043041	3.537160
Iris dataset	No boundary correction	NW	0.01650655	0.093106
	Boundary corrected	LL	0.00141712	1.083385

Table 2

Asymptotic mean integrated square error (AMISE) for the standard Nadaraya–Watson (NW) kernel estimator and local linear (LL) regression estimator computed for various selected datasets.

5. CONCLUSION AND RECOMMENDATION

From the results of the simulated data presented in the respective figures, it is clear that the local linear estimator does not induce the boundary bias into its estimate as is the case with the Nadaraya–Watson estimator. In addition it was revealed from Figure 4 that while the increment of the degree of the local polynomial seems to be reducing the interior bias, the unfortunate result is that it came with a cost of higher variance. They also do not perform adequately at the boundary as the local linear does. The superior performance of the local linear estimator also manifested itself in the figures obtained using various inbuilt datasets in r (see Table 2). Local linear estimator can therefore be recommended for removal of the boundary bias. If however the problem is the interior bias then with proper adjustment of the bandwidth may suffice in some cases. The gains made by moving to higher order degree are insignificant. It is also clear that for both cases the bandwidth play a significant role as a smoothing parameter and thus proper selection should be born in mind during estimation. These facts were supported by the overall smaller values of AMISEs for the local linear estimator that proved to be superior in all aspects to the Nadaraya–Watson estimator as seen from the results given in Tables 2–7.

Sample Size	Nature of Boundary Bias	Estimator	AMISE	h AMISE
n = 25	No boundary correction	NW	0.001677	1.169522
	Boundary corrected	LL	0.001348	1.201303
n = 50	No boundary correction	NW	0.001721	1.063632
	Boundary corrected	LL	0.001402	1.206711
n = 100	No boundary correction	NW	0.001401	1.211507
	Boundary corrected	LL	0.001119	1.453101
n = 500	No boundary correction	NW	0.001291	1.316334
	Boundary corrected	LL	0.001218	1.375686
n = 1000	No boundary correction	NW	0.001254	1.353020
	Boundary corrected	LL	0.001200	1.398421

Table 3

Asymptotic mean integrated square error (AMISE) for the standard Nadaraya–Watson (NW) kernel estimator and local linear (LL) regression estimator (Model 1-Cubic function).

Sample Size	Nature of Boundary Bias	Estimator	AMISE	h AMISE
n = 25	No boundary correction	NW	0.005367	0.288393
	Boundary corrected	LL	0.004966	0.309645
n = 50	No boundary correction	NW	0.005475	0.305584
	Boundary corrected	LL	0.004082	0.375665
n = 100	No boundary correction	NW	0.005138	0.309597
	Boundary corrected	LL	0.003956	0.384638
n = 500	No boundary correction	NW	0.004921	0.340515
	Boundary corrected	LL	0.003893	0.394235
n = 1000	No boundary correction	NW	0.004774	0.347761
	Boundary corrected	LL	0.003845	0.399115

Table 4

Asymptotic mean integrated square error (AMISE) for the standard Nadaraya–Watson (NW) kernel estimator and local linear (LL) regression estimator (Model 2-Bump function).

Sample Size	Nature of Boundary Bias	Estimator	AMISE	h AMISE
n = 25	No boundary correction	NW	0.002797	0.576798
	Boundary corrected	LL	0.002641	0.609480
n = 50	No boundary correction	NW	0.002446	0.685321
	Boundary corrected	LL	0.002169	0.740991
n = 100	No boundary correction	NW	0.002288	0.716167
	Boundary corrected	LL	0.002135	0.759066
n = 500	No boundary correction	NW	0.002085	0.787549
	Boundary corrected	LL	0.002040	0.799239
n = 1000	No boundary correction	NW	0.002057	0.797407
	Boundary corrected	LL	0.002029	0.804035

Table 5

Asymptotic mean integrated square error (AMISE) for the standard Nadaraya–Watson (NW) kernel estimator and local linear (LL) regression estimator (Model 3-Quadratic function).

Sample Size	Nature of Boundary Bias	Estimator	AMISE	h AMISE
n = 25	No boundary correction	NW	0.005368	0.2883934
	Boundary corrected	LL	0.004966	0.3096449
n = 50	No boundary correction	NW	0.004253	0.3672427
	Boundary corrected	LL	0.004097	0.3749881
n = 100	No boundary correction	NW	0.004018	0.3782936
	Boundary corrected	LL	0.003928	0.3845499
n = 500	No boundary correction	NW	0.003970	0.3903203
	Boundary corrected	LL	0.003889	0.3943235
n = 1000	No boundary correction	NW	0.003943	0.3958703
	Boundary corrected	LL	0.003845	0.3990102

Table 6

Asymptotic mean integrated square error (AMISE) for the standard Nadaraya–Watson (NW) kernel estimator and local linear (LL) regression estimator (Model 4-Linear function).

Sample Size	Nature of Boundary Bias	Estimator	AMISE	h AMISE
n = 25	No boundary correction	NW	0.647323	0.0029467
	Boundary corrected	LL	0.613482	0.0031301
n = 50	No boundary correction	NW	0.723391	0.0025802
	Boundary corrected	LL	0.635071	0.0030013
n = 100	No boundary correction	NW	0.749663	0.0027027
	Boundary corrected	LL	0.697904	0.0029576
n = 500	No boundary correction	NW	0.673946	0.0030442
	Boundary corrected	LL	0.646734	0.0031898
n = 1000	No boundary correction	NW	0.699661	0.0030770
	Boundary corrected	LL	0.648226	0.0031890

Table 7

Asymptotic mean integrated square error (AMISE) for the standard Nadaraya–Watson (NW) kernel estimator and local linear (LL) regression estimator (Model 5-Exponential function).

CONFLICT OF INTEREST

None

AUTHORS' CONTRIBUTIONS

The study has elaborated clearly on the source of the bias in Nadaraya-Watson estimator and how the Local Linear estimator addresses the problem.

Funding Statement

I have solely funded the research by myself. This is an article that i did it alone.

ACKNOWLEDGMENTS

I wish to acknowledge my parent and family members for the support and encouragement. I also appreciate my colleagues at the Department for challenging me to carry out this research.

REFERENCES

1.A. Gramacki, Nonparametric Kernel Density Estimation and its Computational Aspects, Springer International Publishing, Cham, Switzerland, 2018.

2.A.H. Dorfman, Nonparametric regression for estimating totals in finite populations, American Statistics Association, in Proceedings of the Section on Survey Research Methods (Alexandria, VA, USA), 1992, pp. 622-625.

3.D.G. Horvitz and D.J. Thompson, J. Am. Stat. Assoc., Vol. 47, 1952, pp. 663-685.

4.P. Hall and B. Presnel, J. R. Stat. Soc. Ser. B., Vol. 61, 1999, pp. 143-158.

5.Z. Cai, Stat. Probab. Lett., Vol. 51, 2001, pp. 307-318.

6.R.B. Salha and H.I.E. Ahmed, Int. J. Stat. Probab., Vol. 4, 2015, pp. 138-147.

7.I.O. Herbert, G.O. Orwa, and R.O. Otieno, Int. J. Theor. Appl. Math., Vol. 3, 2017, pp. 122-128.

8.E.A. Nadaraya, Theory Probab. Appl., Vol. 9, 1964, pp. 141-142.

9.G.S. Watson, Smooth Regression Analysis, Sankhya, Series A, The Indian Journal of Statistics, 1964, pp. 359-372. JSTOR 25049340 https://www.jstor.org/stable/25049340

10.B.F. Breidt and J.D. Opsomer, Ann. Stat., Vol. 28, 2000, pp. 1026-1053.

11.F.J. Breidt and J.D. Opsomer, Nonparametric and semiparametric estimation in complex surveys, D. Pfeffermann and C.R. Rao (editors), Handbooks of Statistics, Vol. 29, 2009, pp. 103-121.

12.A.B. Tsybakov, Introduction to Nonparametric Estimation, Springer Science+Business Media, LLC, New York, NY, USA, 2009.

13.W. Härdle, Applied Nonparametric Regression Analysis, Cambridge University Press, Cambridge, UK, 1994.

14.K. Takezawa, Introduction to Nonparametric Regression, John Wiley, Hoboken, NJ, USA, 2006.

15.W. Härdle et al., Nonparametric and Semiparametric Models-an Introduction, Springer Verlag, Berlin, Heidelberg, Germany, 2005.

16.W. Zucchini, Applied Smoothing Techniques Part 1: Kernel Density Estimation, 2003. Retrieved from www.staff.ustc.edu.cn/~zwp/teach/Math-Stat/kernel.pdf

17.M.A. Manzoor, A. Akbar, and M.A. Ullah, Pak. J. Soc. Sci., Vol. 33, 2013, pp. 77-85.

18.M.P. Wand and M.C. Jones, Kernel Smoothing, Chapman and Hall, New York, NY, USA, 1995.

19.B.E. Hansen, Lecture Notes on Nonparametrics, University of Wisconsin, Madison, WI, USA, 2009.

20.G. Lászlό et al., A Distribution-Free Theory of Nonparametric Regression, Springer-Verlag, New York, NY, USA, 2002.

21.M. Avery, Literature Review for Local Polynomial Regression, Unpublished Manuscript, 2010. http://www4.ncsu.edu/~mravery/AveryReview2df

22.J. Fan and I. Gijbels, Ann. Stat., Vol. 20, 1992, pp. 2008-2036.

23.C. Loader, Local Regression and Likelihood, Springer-Verlag, New York, NY, USA, 1999.

24.D. Ruppert, S.J. Sheather, and M.P. Wand, J. Am. Stat. Assoc., Vol. 90, 1995, pp. 1257-1270.

<Previous Article In Issue

Download article (PDF)

Journal: Journal of Statistical Theory and Applications
Volume-Issue: 19 - 3
Pages: 460 - 471
Publication Date: 2020/10/23
ISSN (Online): 2214-1766
ISSN (Print): 1538-7887
DOI: 10.2991/jsta.d.201016.001 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - JOUR
AU  - Langat Reuben Cheruiyot
PY  - 2020
DA  - 2020/10/23
TI  - Local Linear Regression Estimator on the Boundary Correction in Nonparametric Regression Estimation
JO  - Journal of Statistical Theory and Applications
SP  - 460
EP  - 471
VL  - 19
IS  - 3
SN  - 2214-1766
UR  - https://doi.org/10.2991/jsta.d.201016.001
DO  - 10.2991/jsta.d.201016.001
ID  - Cheruiyot2020
ER  -

download .riscopy to clipboard