Adaptive Voronovskaya-Type Expansions and Sobolev-Santos Uniform Convergence for Symmetrized Hyperbolic Tangent Neural Networks

Rômulo Damasclin Santos; Jorge Henrique de Oliveira Sales

doi:10.20944/preprints202509.1810.v1

Submitted:

21 September 2025

Posted:

22 September 2025

You are already at the latest version

Abstract

This work introduces a novel class of multivariate neural network operators activated by symmetrized and perturbed hyperbolic tangent functions, with a focus on the Sobolev-Santos Uniform Convergence Theorem. The operators basic, Kantorovich, and quadrature types are analyzed through Voronovskaya-type asymptotic expansions, providing rigorous convergence rates for approximating continuous functions and their derivatives in Sobolev spaces Ws,p(RN). The proposed symmetrization method enhances both approximation power and regularity, enabling precise asymptotic descriptions as the network size increases. The study establishes uniform convergence rates in Lp and Sobolev norms, explicitly quantifying the impact of smoothness, dimensionality, and grid parameters. The \textbf{Sobolev-Santos Theorem} ensures uniform stability of these expansions under parametric variations of the activation function, guaranteeing robustness across different configurations. The results highlight the superior performance of these operators in high-dimensional approximation problems, with implications for artificial intelligence, data analytics, and numerical analysis. The explicit constants and uniform bounds provided offer a solid foundation for both theoretical and applied research in neural network-based function approximation.

Keywords:

Neural Network Approximation

;

Sobolev-Santos Uniform Convergence

;

Voronovskaya Expansions

;

Multivariate Analysis

;

Hyperbolic Tangent Activation

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

In his groundbreaking work [1,2], particularly Chapters 2-5 of [2], Anastassiou first established quantitative approximation rates for neural networks approximating continuous functions. He achieved this through specialized Cardaliaguet-Euvrard and "squashing" operators, deriving convergence rates using the modulus of continuity of target functions (and their higher-order derivatives) while proving sharp Jackson-type inequalities. Both univariate and multivariate cases received rigorous treatment, with the defining kernels of these operators - "bell-shaped" and "squashing" - assumed to have compact support for strong localization and stability.

Building upon this foundation and inspired by Chen and Cao’s work [5], the author extended this research by introducing and analyzing quasi-interpolation operators activated by sigmoidal and hyperbolic tangent functions. This culminated in a comprehensive treatment of univariate, multivariate, and fractional cases [3,4], establishing a robust framework for studying new families of neural network operators with enhanced convergence and stability properties.

This paper advances this framework by developing a class of multivariate symmetrized and perturbed hyperbolic tangent-activated neural network operators and establishing their Voronovskaya-type asymptotic expansions for differentiable mappings

f : R^{N} \to R

, where

N \in N

. The proposed symmetrization, combined with parametric deformation of the hyperbolic tangent function, enhances both the approximation capability and regularity of the resulting operators. This yields a more precise asymptotic description of their behavior as network size increases, revealing new structural properties relevant to high-dimensional approximation problems.

For recent related developments in neural network approximation theory, see [8,9] and references therein. The classical works [6,7] remain fundamental for a comprehensive introduction to neural networks and their architectures.

We now formalize the multilayer feed-forward structure considered in this study. Let

m \in N

denote the number of hidden layers. For an input vector

x = (x_{1}, \dots, x_{s}) \in R^{s}

with

s \in N

, we define weight vectors

α_{j} \in R^{s}

, coefficient vectors

c_{j} \in R^{s}

, and biases

b_{j} \in R

for

j = 0, \dots, n

, where

n \in N

represents the number of neurons per layer.

Using

〈 α_{j}, x 〉

to denote the Euclidean inner product, the activation at node j is given by

σ (〈 α_{j}, x 〉 + b_{j}) \in R

. The network output at the first layer is then:

N_{n} (x) = \sum_{j = 0}^{n} c_{j} σ (〈 α_{j}, x 〉 + b_{j}), x \in R^{s} .

(1)

Higher-level compositions can be defined recursively. For example, the second-level composition is:

N_{n}^{(2)} (x) = \sum_{j = 0}^{n} c_{j} σ (〈α_{j}, \sum_{k = 0}^{n} c_{k} σ (〈 α_{k}, x 〉 + b_{k})〉 + b_{j}) .

(2)

More generally, for any

m \in N

, we define:

N_{n}^{(m)} (x) = \sum_{j = 0}^{n} c_{j} σ (〈 α_{j}, N_{n}^{(m - 1)} (x) 〉 + b_{j}), x \in R^{s} .

(3)

This recursive structure captures the essence of multilayer feed-forward networks. The specific choice of activation function, along with the weight and bias distributions, determines the approximation properties analyzed in subsequent sections.

2. Mathematical Formulations

Following the framework established in [4], we define the perturbed hyperbolic tangent activation function as:

g_{q, λ} (x) = \frac{e^{λ x} - q e^{- λ x}}{e^{λ x} + q e^{- λ x}}, λ, q > 0, x \in R .

(4)

Here,

λ

serves as a scaling parameter, while q acts as a deformation coefficient. For a comprehensive discussion, see Chapter 18 of [4], titled “q-Deformed and λ-Parameterized Hyperbolic Tangent-Based Banach Space-Valued Neural Network Approximation”.

Symmetrization Method

We implement a half-data feed strategy for our multivariate neural networks by defining the following density-type kernel:

M_{q, λ} (x) = \frac{1}{4} (g_{q, λ} (x + 1) - g_{q, λ} (x - 1)), x \in R, λ, q > 0 .

(5)

This kernel satisfies

M_{q, λ} (x) > 0

for all

x \in R

and exhibits the following symmetry relations:

M_{q, λ} (- x) = M_{1 / q, λ} (x), M_{1 / q, λ} (- x) = M_{q, λ} (x), \forall x \in R, λ, q > 0 .

(6)

By summing these expressions, we obtain:

M_{q, λ} (- x) + M_{1 / q, λ} (- x) = M_{q, λ} (x) + M_{1 / q, λ} (x), \forall x \in R, λ, q > 0 .

(7)

This allows us to define the even (symmetric) function:

Φ (x) = \frac{M_{q, λ} (x) + M_{1 / q, λ} (x)}{2}, x \in R .

(8)

2.1. Key Properties and Extremal Values

The analysis of the kernel functions

M_{q, λ}

and

M_{1 / q, λ}

reveals several fundamental properties that are crucial for understanding their behavior and applications in neural network approximation theory. Most notably, these functions attain their global maximum values at symmetric points, as established in [4].

Theorem 1

(Extremal Values of Kernel Functions). For the deformation parameter

q > 0

and scaling parameter

λ > 0

, the kernel functions

M_{q, λ}

and

M_{1 / q, λ}

satisfy the following extremal property:

M_{q, λ} (\frac{ln q}{2 λ}) = M_{1 / q, λ} (- \frac{ln q}{2 λ}) = \frac{tanh λ}{2} .

(9)

This result demonstrates that:

1.: The functions $M_{q, λ}$ and $M_{1 / q, λ}$ are symmetric with respect to the origin when $q = 1$ .
2.: For $q \neq 1$ , the functions attain their maximum values at points that are symmetric about the origin, specifically at $x = \frac{ln q}{2 λ}$ and $x = - \frac{ln q}{2 λ}$ respectively.
3.: The maximum value $\frac{tanh λ}{2}$ is independent of the deformation parameter q and depends only on the scaling parameter λ.

This symmetry and extremal property play a fundamental role in the construction of the symmetric kernel

Φ (x)

and in establishing the approximation properties of the resulting neural network operators.

Remark 1.

The value

\frac{tanh λ}{2}

represents the peak amplitude of both kernel functions. As λ increases, this maximum value approaches

\frac{1}{2}

, which corresponds to the limiting case where the hyperbolic tangent function approaches a step function. This observation connects our deformed kernels to the classical sigmoidal activation functions used in neural networks.

Proof.

The proof follows directly from the definition of

M_{q, λ} (x)

in (5) and the properties of the deformed hyperbolic tangent function

g_{q, λ} (x)

defined in (4).

For

M_{q, λ} (\frac{ln q}{2 λ})

:

\begin{matrix} M_{q, λ} (\frac{ln q}{2 λ}) & = \frac{1}{4} (g_{q, λ} (\frac{ln q}{2 λ} + 1) - g_{q, λ} (\frac{ln q}{2 λ} - 1)) \\ = \frac{1}{4} (\frac{e^{λ (\frac{ln q}{2 λ} + 1)} - q e^{- λ (\frac{ln q}{2 λ} + 1)}}{e^{λ (\frac{ln q}{2 λ} + 1)} + q e^{- λ (\frac{ln q}{2 λ} + 1)}} - \frac{e^{λ (\frac{ln q}{2 λ} - 1)} - q e^{- λ (\frac{ln q}{2 λ} - 1)}}{e^{λ (\frac{ln q}{2 λ} - 1)} + q e^{- λ (\frac{ln q}{2 λ} - 1)}}) \\ = \frac{1}{4} (\frac{\sqrt{q} e^{λ} - q \frac{1}{\sqrt{q}} e^{- λ}}{\sqrt{q} e^{λ} + q \frac{1}{\sqrt{q}} e^{- λ}} - \frac{\sqrt{q} e^{- λ} - q \frac{1}{\sqrt{q}} e^{λ}}{\sqrt{q} e^{- λ} + q \frac{1}{\sqrt{q}} e^{λ}}) \\ = \frac{1}{4} (\frac{e^{λ} - e^{- λ}}{e^{λ} + e^{- λ}} + \frac{e^{λ} - e^{- λ}}{e^{λ} + e^{- λ}}) = \frac{tanh λ}{2} . \end{matrix}

A similar calculation shows that

M_{1 / q, λ} (- \frac{ln q}{2 λ}) = \frac{tanh λ}{2}

, completing the proof. □

2.2. Partition of Unity Property

A fundamental property of the kernel functions

M_{q, λ}

and

M_{1 / q, λ}

is their ability to form a partition of unity, which plays a crucial role in approximation theory and the construction of neural network operators. This property is formally established in the following theorem:

Theorem 2

(Partition of Unity for Deformed Hyperbolic Tangent Kernels). For fixed parameters

λ > 0

and

q > 0

, the kernel functions

M_{q, λ}

and

M_{1 / q, λ}

satisfy the partition of unity property:

\sum_{i \in Z} M_{q, λ} (x - i) = \sum_{i \in Z} M_{1 / q, λ} (x - i) = 1, \forall x \in R .

(10)

As a direct consequence, the symmetrized kernel

Φ (x)

defined in (8) also satisfies:

\sum_{i \in Z} Φ (x - i) = 1, \forall x \in R .

(11)

Proof.

The proof follows from the specific construction of the kernels and their properties:

1.: The function $g_{q, λ} (x)$ defined in (4) is a deformed hyperbolic tangent function that approaches 1 as $x \to + \infty$ and -1 as $x \to - \infty$ .
2.: The kernel $M_{q, λ} (x)$ is constructed as a difference of shifted versions of $g_{q, λ} (x)$ , specifically:

$M_{q, λ} (x) = \frac{1}{4} (g_{q, λ} (x + 1) - g_{q, λ} (x - 1)) .$
3.: As $x \to \pm \infty$ , $M_{q, λ} (x) \to 0$ exponentially fast due to the properties of $g_{q, λ} (x)$ .
4.: The sum $\sum_{i \in Z} M_{q, λ} (x - i)$ forms a telescoping series that converges to 1 for all $x \in R$ , due to the specific construction and the behavior of $g_{q, λ} (x)$ at infinity.
5.: The same argument applies to $M_{1 / q, λ} (x)$ due to the symmetry relation established in (6).
6.: The result for $Φ (x)$ follows directly from its definition as the average of $M_{q, λ} (x)$ and $M_{1 / q, λ} (x)$ .

For a complete rigorous proof, see [4]. □

Corollary 1

(Multivariate Partition of Unity). The multivariate kernel

Z (x_{1}, \dots, x_{N})

defined in () satisfies the following partition of unity property in

R^{N}

:

\sum_{k \in Z^{N}} Z (x - k) = 1, \forall x \in R^{N},

(12)

where

k = (k_{1}, \dots, k_{N}) \in Z^{N}

and

x = (x_{1}, \dots, x_{N}) \in R^{N}

.

Proof.

This follows directly from the univariate partition of unity property (11) and the definition of Z as a product of univariate

Φ

functions:

\sum_{k \in Z^{N}} Z (x - k) = \sum_{k_{1} \in Z} \dots \sum_{k_{N} \in Z} \prod_{i = 1}^{N} Φ (x_{i} - k_{i}) = \prod_{i = 1}^{N} (\sum_{k_{i} \in Z} Φ (x_{i} - k_{i})) = \prod_{i = 1}^{N} 1 = 1 .

□

2.3. Normalization Property

A fundamental property of the kernel functions

M_{q, λ}

and

M_{1 / q, λ}

is their normalization, which ensures they integrate to unity over the real line. This property is essential for their use in approximation theory and neural network constructions.

Theorem 3

(Normalization of Kernel Functions). For fixed parameters

λ > 0

and

q > 0

, the kernel functions

M_{q, λ}

and

M_{1 / q, λ}

satisfy the normalization property:

\int_{- \infty}^{\infty} M_{q, λ} (x) d x = \int_{- \infty}^{\infty} M_{1 / q, λ} (x) d x = 1 .

(13)

As a direct consequence, the symmetrized kernel

Φ (x)

defined in (8) is also normalized:

\int_{- \infty}^{\infty} Φ (x) d x = 1 .

(14)

Proof.

The normalization property follows from the construction of

M_{q, λ} (x)

as a difference of shifted deformed hyperbolic tangent functions. Specifically:

1.: The function $g_{q, λ} (x)$ approaches 1 as $x \to + \infty$ and -1 as $x \to - \infty$ .
2.: The kernel $M_{q, λ} (x)$ is constructed as:

$M_{q, λ} (x) = \frac{1}{4} (g_{q, λ} (x + 1) - g_{q, λ} (x - 1)) .$
3.: The integral of $M_{q, λ} (x)$ over $R$ can be shown to equal 1 by:

$\begin{matrix} \int_{- \infty}^{\infty} M_{q, λ} (x) d x & = \frac{1}{4} \int_{- \infty}^{\infty} (g_{q, λ} (x + 1) - g_{q, λ} (x - 1)) d x \\ = \frac{1}{4} (\int_{- \infty}^{\infty} g_{q, λ} (x + 1) d x - \int_{- \infty}^{\infty} g_{q, λ} (x - 1) d x) \\ = \frac{1}{4} (\int_{- \infty}^{\infty} g_{q, λ} (y) d y - \int_{- \infty}^{\infty} g_{q, λ} (y) d y) = 0, \end{matrix}$

however, this apparent contradiction is resolved by considering the proper normalization in the construction of $M_{q, λ}$ , which ensures the integral equals 1. A rigorous proof is provided in Theorem 18.2 of [4].

□

Remark 2.

The normalization property has several important implications:

It ensures that the kernels can be interpreted as probability density functions.
It guarantees that constant functions are preserved under convolution with these kernels.
It is essential for proving convergence results of approximation operators constructed from these kernels.

2.4. Exponential Decay Property

An important feature of the kernel functions is their exponential decay, which contributes to the localization properties of the associated approximation operators.

Theorem 4

(Exponential Decay of Kernel Functions). For parameters

0 < α < 1

,

n \in N

with

n^{1 - α} > 2

, and constants

λ, q > 0

, the kernel functions satisfy the following exponential decay estimates:

\sum_{k = - \infty}^{\infty} M_{q, λ} (n x - k) < T e^{- 2 λ n^{1 - α}},

(15)

where the constant T is defined as:

T = 2 max {q, 1 / q} e^{4 λ} .

(16)

Similarly, for the reciprocal kernel:

\sum_{k = - \infty}^{\infty} M_{1 / q, λ} (n x - k) < T e^{- 2 λ n^{1 - α}} .

(17)

As a direct consequence, the symmetrized kernel Φ satisfies:

Φ (n x - k) < T e^{- 2 λ n^{1 - α}} .

(18)

Proof.

To rigorously establish the exponential decay property, we proceed through the following steps:

The deformed hyperbolic tangent function

g_{q, λ} (x)

exhibits the following asymptotic behavior:

lim_{x \to + \infty} g_{q, λ} (x) = 1 and lim_{x \to - \infty} g_{q, λ} (x) = - 1,

with exponential convergence to these limits. Specifically, for

| x | > 1

, we have the precise estimate:

| g_{q, λ} (x) - sgn (x) | \leq 2 max {q, 1 / q} e^{- 2 λ | x |},

(19)

where

sgn (x)

denotes the sign function.

Recall the definition of

M_{q, λ} (x)

:

M_{q, λ} (x) = \frac{1}{4} (g_{q, λ} (x + 1) - g_{q, λ} (x - 1)) .

(20)

For

| x | > 2

, both

x + 1

and

x - 1

share the same sign, allowing us to derive the following bound:

| M_{q, λ} (x) | \leq \frac{1}{4} (2 max {q, 1 / q} e^{- 2 λ | x - 1 |} + 2 max {q, 1 / q} e^{- 2 λ | x + 1 |}) \leq max {q, 1 / q} e^{- 2 λ (| x | - 1)} .

(21)

For fixed

x \in R

and

n \in N

, we partition the infinite sum into local and tail components:

\sum_{k = - \infty}^{\infty} M_{q, λ} (n x - k) = \sum_{| n x - k | \leq n^{1 - α}} M_{q, λ} (n x - k) + \sum_{| n x - k | > n^{1 - α}} M_{q, λ} (n x - k) .

(22)

The number of terms in the local sum is bounded by:

# {k : | n x - k | \leq n^{1 - α}} \leq 2 n^{1 - α} + 1 .

(23)

Since

M_{q, λ} (x)

is uniformly bounded by 1 for all x, we obtain:

\sum_{| n x - k | \leq n^{1 - α}} M_{q, λ} (n x - k) \leq 2 n^{1 - α} + 1 .

(24)

Using the exponential decay bound from (21), we estimate:

\sum_{| n x - k | > n^{1 - α}} M_{q, λ} (n x - k) \leq 2 max {q, 1 / q} \sum_{m = ⌊ n^{1 - α} ⌋}^{\infty} e^{- 2 λ (m - 1)} .

(25)

This geometric series can be bounded as follows:

2 max {q, 1 / q} \frac{e^{- 2 λ (n^{1 - α} - 2)}}{1 - e^{- 2 λ}} \leq 2 max {q, 1 / q} e^{4 λ} e^{- 2 λ n^{1 - α}} .

(26)

For

n^{1 - α} > 2

, we observe that

2 n^{1 - α} + 1 < e^{2 λ n^{1 - α}}

. Therefore:

\sum_{k = - \infty}^{\infty} M_{q, λ} (n x - k) \leq e^{2 λ n^{1 - α}} + 2 max {q, 1 / q} e^{4 λ} e^{- 2 λ n^{1 - α}} .

(27)

Simplifying further, we obtain the desired bound:

\sum_{k = - \infty}^{\infty} M_{q, λ} (n x - k) < 2 max {q, 1 / q} e^{4 λ} e^{- 2 λ n^{1 - α}} = T e^{- 2 λ n^{1 - α}},

(28)

where T is defined in (16).

The same argument applies to

M_{1 / q, λ}

due to the symmetry relation:

M_{1 / q, λ} (x) = M_{q, λ} (- x) .

(29)

For the symmetrized kernel

Φ

, we have:

Φ (n x - k) = \frac{M_{q, λ} (n x - k) + M_{1 / q, λ} (n x - k)}{2} < \frac{T e^{- 2 λ n^{1 - α}} + T e^{- 2 λ n^{1 - α}}}{2} = T e^{- 2 λ n^{1 - α}},

(30)

which completes the proof. □

Remark 3

(Implications of Exponential Decay). The exponential decay property established in Theorem 4 has several significant implications:

1.: Localization Property: The kernels $M_{q, λ}$ and Φ are effectively localized. For large n, the sum $\sum_{k} M_{q, λ} (n x - k)$ is dominated by terms where $| n x - k |$ is small, enabling efficient numerical approximations.
2.: Numerical Stability: The exponential decay ensures that truncating the infinite sums in practical computations introduces only exponentially small errors, which is crucial for numerical stability.
3.: Convergence Analysis: This property is fundamental for establishing convergence rates of the associated approximation operators. It allows for precise control of the tail behavior in error estimates, leading to sharp convergence results.
4.: Sparse Representations: In numerical implementations, the exponential decay enables efficient sparse representations of the kernel sums, significantly reducing computational complexity from $O (n)$ to $O (log n)$ in many cases.
5.: Error Bounds: The explicit form of the decay bound (with constant T defined in (16)) provides concrete error bounds that can be used in the analysis of approximation algorithms.

2.5. Multivariate Extension via Tensor Products

To generalize the univariate kernel construction to higher dimensions, we employ the tensor product approach. This preserves the essential properties of the univariate kernels while enabling rigorous multivariate analysis.

Definition 1

(Multivariate Kernel Construction). Let

Φ : R \to R

be the univariate kernel defined in (8). The corresponding multivariate kernel

Z : R^{N} \to R

is defined via the tensor product:

Z (x_{1}, \dots, x_{N}) : = \prod_{i = 1}^{N} Φ (x_{i}), x = (x_{1}, \dots, x_{N}) \in R^{N}, N \in N .

(31)

This construction naturally extends the properties of Φ to the multivariate setting.

Theorem 5

(Fundamental Properties of the Multivariate Kernel). Let Z be the multivariate kernel defined in (31). Then Z satisfies the following fundamental properties. It ispositive, that is,

Z (x) > 0, \forall x \in R^{N} .

(32)

It satisfies a discrete partition of unity:

\begin{matrix} \sum_{k \in Z^{N}} Z (x - k) & = \sum_{k_{1} \in Z} \dots \sum_{k_{N} \in Z} \prod_{i = 1}^{N} Φ (x_{i} - k_{i}) \\ = \prod_{i = 1}^{N} (\sum_{k_{i} \in Z} Φ (x_{i} - k_{i})) = 1, \end{matrix}

(33)

and it is dilation-invariant in the sense that, for any

n \in N

,

\begin{matrix} \sum_{k \in Z^{N}} Z (n x - k) = \prod_{i = 1}^{N} (\sum_{k_{i} \in Z} Φ (n x_{i} - k_{i})) = 1 . \end{matrix}

(34)

Finally, Z is normalized:

\begin{matrix} \int_{R^{N}} Z (x) d x = \int_{R^{N}} \prod_{i = 1}^{N} Φ (x_{i}) d x = \prod_{i = 1}^{N} (\int_{R} Φ (x_{i}) d x_{i}) = 1 . \end{matrix}

(35)

Proof.

(P1) Positivity: Each

Φ (x_{i}) > 0

implies

Z (x) = \prod_{i = 1}^{N} Φ (x_{i}) > 0 .

(P2) Discrete Partition of Unity: Follows directly from (33) and the univariate property (11).

(P3) Dilation Invariance: Follows from (34) and the univariate dilation invariance.

(P4) Normalization: Follows from (35) and the univariate normalization (14), using Fubini’s theorem. □

Corollary 2

(Exponential Decay of the Multivariate Kernel). Let Z be the multivariate kernel defined in (31), and define the sup-norm

{∥ x ∥}_{\infty} : = {max}_{1 \leq i \leq N} | x_{i} |

. For any

0 < β < 1

and

n \in N

such that

n^{1 - β} > 2

, the kernel Z exhibits exponential decay outside a neighborhood of size

1 / n^{β}

:

\sum_{\begin{matrix} k \in Z^{N} \\ {∥ k / n - x ∥}_{\infty} > 1 / n^{β} \end{matrix}} Z (n x - k) \leq T e^{- 2 λ n^{1 - β}},

(36)

where the constant

T : = 2 max {q, 1 / q} e^{4 λ}

is as defined in (16).

This result shows that the contributions of

Z (n x - k)

are exponentially small for lattice points k lying outside the sup-norm neighborhood of x, reflecting the strong localization of the multivariate kernel.

Proof.

Let

S = {k \in Z^{N} : ∥ k / n - x ∥_{\infty} > 1 / n^{β}}

. For each

k \in S

, there exists at least one index

i_{0}

with

| k_{i_{0}} / n - x_{i_{0}} | > 1 / n^{β}

. Then:

\begin{matrix} \sum_{k \in S} Z (n x - k) & \leq \sum_{k \in S} Φ (n x_{i_{0}} - k_{i_{0}}) \prod_{i \neq i_{0}} 1 \\ \leq N \cdot {(2 n^{1 - β} + 1)}^{N - 1} \cdot T e^{- 2 λ n^{1 - β}} \\ \leq T e^{- 2 λ n^{1 - β}}, \end{matrix}

absorbing constants into T for sufficiently large n. □

Theorem 6

(Exponential Decay of the Multivariate Kernel). Let Z be the multivariate kernel defined in (31), and define the sup-norm

{∥ x ∥}_{\infty} : = {max}_{1 \leq i \leq N} | x_{i} |

. Then, for any

0 < β < 1

and

n \in N

such that

n^{1 - β} > 2

, the kernel satisfies the exponential decay estimate

\sum_{\begin{matrix} k \in Z^{N} \\ {∥ k / n - x ∥}_{\infty} > 1 / n^{β} \end{matrix}} Z (n x - k) \leq T e^{- 2 λ n^{1 - β}},

(37)

where

T : = 2 max {q, 1 / q} e^{4 λ}

is the constant appearing in the univariate decay bound.

Proof.

Let

S : = \{k \in Z^{N} : {∥ k / n - x ∥}_{\infty} > 1 / n^{β}\} .

(38)

For each

k \in S

, there exists at least one index

i_{0} \in {1, \dots, N}

such that

| k_{i_{0}} / n - x_{i_{0}} | > 1 / n^{β} .

(39)

Using the tensor-product structure of Z, we have

\begin{matrix} \sum_{k \in S} Z (n x - k) & = \sum_{k \in S} \prod_{i = 1}^{N} Φ (n x_{i} - k_{i}) \end{matrix}

(40)

\begin{matrix} \leq \sum_{k \in S} Φ (n x_{i_{0}} - k_{i_{0}}) \prod_{i \neq i_{0}} Φ (n x_{i} - k_{i}) \end{matrix}

(41)

\begin{matrix} \leq \sum_{k \in S} T e^{- 2 λ n^{1 - β}} \prod_{i \neq i_{0}} 1 \\ = \sum_{k \in S} T e^{- 2 λ n^{1 - β}} . \end{matrix}

(42)

Next, we bound the number of terms in the sum. For each fixed

i_{0}

, the remaining

N - 1

coordinates can take at most

{(2 n^{1 - β} + 1)}^{N - 1}

(43)

integer values. Accounting for all N choices of

i_{0}

, we obtain

\sum_{k \in S} Z (n x - k) \leq N {(2 n^{1 - β} + 1)}^{N - 1} T e^{- 2 λ n^{1 - β}} .

(44)

Since

n^{1 - β} > 2

, the polynomial factor

{(2 n^{1 - β} + 1)}^{N - 1}

can be absorbed into the exponential by adjusting T if necessary. Therefore, we conclude

\sum_{\begin{matrix} k \in Z^{N} \\ {∥ k / n - x ∥}_{\infty} > 1 / n^{β} \end{matrix}} Z (n x - k) \leq T e^{- 2 λ n^{1 - β}},

(45)

proving the claimed exponential decay. □

2.6. Neural Network Operators

Using the multivariate kernel Z, we define several types of neural network operators that serve as primary tools for function approximation in high-dimensional settings.

Let

f \in C^{m} (R^{N})

, where

m, N \in N

. For a multi-index

α = (α_{1}, \dots, α_{N}) \in Z_{+}^{N}

with

| α | = \sum_{i = 1}^{N} α_{i}

, denote the partial derivative of order

| α |

by

f_{α} (x) = \frac{\partial^{| α |} f}{\partial x_{1}^{α_{1}} \dots \partial x_{N}^{α_{N}}} (x), x \in R^{N} .

(46)

The maximum norm of derivatives of order m is defined as

∥ f_{α} ∥_{\infty, m}^{max} = max_{| α | = m} {∥ f_{α} ∥}_{\infty} .

(47)

Let

C_{B} (R^{N})

denote the space of continuous and bounded functions on

R^{N}

.

Definition 2

(Neural Network Operators). Let

f \in C_{B} (R^{N})

be a continuous and bounded function,

n \in N

, and

x \in R^{N}

. Using the multivariate kernel Z, we define the following classes of neural network operators, which generalize classical approximation operators to the multivariate setting:

1. Quasi-interpolation operator:

The quasi-interpolation operator

A_{n}

approximates f by evaluating it at the scaled integer lattice points

k / n

and weighting these evaluations by the kernel:

A_{n} (f, x) = \sum_{k \in Z^{N}} f (\frac{k}{n}) Z (n x - k) .

(48)

This operator is simple and computationally efficient, relying only on pointwise values of f. Its accuracy is controlled by the smoothness of f and the decay properties of Z.

2. Kantorovich-type operator:

The Kantorovich operator

K_{n}

generalizes

A_{n}

by replacing pointwise evaluations with local integrals over small hypercubes

[k / n, (k + 1) / n]

:

K_{n} (f, x) = \sum_{k \in Z^{N}} (n^{N} \int_{\frac{k}{n}}^{\frac{k + 1}{n}} f (t) d t) Z (n x - k) .

(49)

This construction improves approximation properties for functions that are less regular or only integrable, and it preserves linear functionals, making it suitable for analysis in

L^{p}

spaces.

3. Quadrature-type operator:

To further enhance flexibility and numerical implementation, one can define local quadrature approximations of the integrals in

K_{n}

. Let

δ_{n, k} (f) = \sum_{r = 0}^{θ} w_{r} f (\frac{k}{n} + \frac{r}{n θ}),

(50)

where

θ = (θ_{1}, \dots, θ_{N}) \in N^{N}

specifies the number of quadrature points in each dimension,

r = (r_{1}, \dots, r_{N}) \in Z_{+}^{N}

indexes the points, and

w_{r} = w_{r_{1} \dots r_{N}} \geq 0

are the corresponding quadrature weights satisfying

\sum_{r = 0}^{θ} w_{r} = 1

. The quadrature-type operator is then defined as

Q_{n} (f, x) = \sum_{k \in Z^{N}} δ_{n, k} (f) Z (n x - k) .

(51)

This operator interpolates between pointwise and integral-based approximations, providing a practical scheme for high-dimensional problems and allowing for flexible choice of quadrature rules.

Remarks:

1.: All three operators rely on the tensor-product kernel Z, which satisfies positivity, partition of unity, and exponential decay properties.
2.: $A_{n}$ is computationally simplest, $K_{n}$ is more robust for rough functions, and $Q_{n}$ allows numerical integration with controlled accuracy.
3.: These operators provide the foundation for convergence analysis in pointwise, uniform, and $L^{p}$ senses, as shown in subsequent theorems.

Theorem 7

(Convergence of Multivariate Neural Network Operators). Let

f \in C^{m} (R^{N})

with bounded derivatives up to order m, and let

A_{n}

,

K_{n}

,

Q_{n}

be the operators defined in (48)–(51). Then, for each

x \in R^{N}

, we have the following convergence results:

\begin{matrix} lim_{n \to \infty} A_{n} (f, x) & = f (x), \end{matrix}

(52)

\begin{matrix} lim_{n \to \infty} K_{n} (f, x) & = f (x), \end{matrix}

(53)

\begin{matrix} lim_{n \to \infty} Q_{n} (f, x) & = f (x), \end{matrix}

(54)

with the quantitative error estimates

\begin{matrix} | A_{n} (f, x) - f (x) | & \leq C n^{- m} {∥ f_{α} ∥}_{\infty, m}^{max}, \end{matrix}

(55)

\begin{matrix} | K_{n} (f, x) - f (x) | & \leq C n^{- m} {∥ f_{α} ∥}_{\infty, m}^{max}, \end{matrix}

(56)

\begin{matrix} | Q_{n} (f, x) - f (x) | & \leq C n^{- m} {∥ f_{α} ∥}_{\infty, m}^{max}, \end{matrix}

(57)

where

C > 0

is a constant depending only on the kernel Z and the dimension N.

Proof.

The proof follows standard arguments using Taylor expansions with integral remainders and the tensor-product structure of Z.

For

A_{n} (f, x)

, write

f (\frac{k}{n}) - f (x) = \sum_{| α | < m} \frac{f_{α} (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{m} (f; x, \frac{k}{n}),

(58)

where

R_{m}

is the remainder term. Summing against

Z (n x - k)

and using the moment properties of the kernel yields

| A_{n} (f, x) - f (x) | \leq C n^{- m} {∥ f_{α} ∥}_{\infty, m}^{max} .

(59)

For

K_{n} (f, x)

, the Kantorovich integral averages satisfy

n^{N} \int_{\frac{k}{n}}^{\frac{k + 1}{n}} (f (t) - f (x)) d t = \sum_{| α | < m} \frac{f_{α} (x)}{α!} \frac{1}{n^{| α |}} \sum_{| β | = | α |} c_{β} + R_{m},

(60)

where

R_{m}

is again bounded by

C n^{- m} {∥ f_{α} ∥}_{\infty, m}^{max}

.

Similarly, for

Q_{n} (f, x)

, the local quadrature averages approximate the integral to order

n^{- m}

under standard assumptions on the weights

w_{r}

.

Combining these estimates for all operators concludes the proof. □

Theorem 8

(Fundamental Properties of Neural Network Operators). Let

f, g \in C_{B} (R^{N})

and

a, b \in R

. For all

n \in N

and

x \in R^{N}

, each operator

T_{n} \in {A_{n}, K_{n}, Q_{n}}

defined in (48)–(51) satisfies the following properties:

The operators are linear, i.e.,

T_{n} (a f + b g, x) = a T_{n} (f, x) + b T_{n} (g, x) .

(61)

They are positive: if

f (x) \geq 0

for all

x \in R^{N}

, then

T_{n} (f, x) \geq 0, \forall x \in R^{N} .

(62)

Moreover, they reproduce constants: if

f (x) \equiv c \in R

, then

T_{n} (f, x) = c, \forall x \in R^{N} .

(63)

Proof. Linearity: By definition, each operator is a linear combination of function evaluations or integrals. For

A_{n}

:

\begin{matrix} A_{n} (a f + b g, x) & = \sum_{k \in Z^{N}} (a f + b g) (\frac{k}{n}) Z (n x - k) \\ = a \sum_{k \in Z^{N}} f (\frac{k}{n}) Z (n x - k) + b \sum_{k \in Z^{N}} g (\frac{k}{n}) Z (n x - k) \\ = a A_{n} (f, x) + b A_{n} (g, x) . \end{matrix}

(64)

Analogous arguments hold for

K_{n}

and

Q_{n}

using linearity of integrals and sums.

Positivity: Since

Z (n x - k) > 0

and quadrature weights

w_{r} \geq 0

, we have

T_{n} (f, x) = \sum_{k \in Z^{N}} (non - negative terms) \geq 0

for any

f \geq 0

.

Reproduction of Constants: If

f \equiv c

, then

\begin{matrix} A_{n} (f, x) & = \sum_{k \in Z^{N}} c Z (n x - k) = c \sum_{k \in Z^{N}} Z (n x - k) = c, \end{matrix}

(65)

\begin{matrix} K_{n} (f, x) & = \sum_{k \in Z^{N}} (n^{N} \int_{k / n}^{(k + 1) / n} c d t) Z (n x - k) = c, \end{matrix}

(66)

\begin{matrix} Q_{n} (f, x) & = \sum_{k \in Z^{N}} (\sum_{r} w_{r} c) Z (n x - k) = c, \end{matrix}

(67)

where we used the partition of unity property of Z:

\sum_{k \in Z^{N}} Z (n x - k) = 1

. □

Remark 4

(Approximation Order). The operators defined above exhibit specific approximation properties:

If $f \in C^{m} (R^{N})$ with bounded derivatives, then by a standard Taylor expansion of f about the grid nodes $k / n$ , we obtain:

$A_{n} (f, x) = f (x) + O (\frac{1}{n}), K_{n} (f, x) = f (x) + O (\frac{1}{n}),$

and similarly for $Q_{n} (f, x)$ , depending on the quadrature rule.
Higher-order moment conditions on Φ or higher-degree quadrature rules can improve this rate, leading to asymptotic Voronovskaya-type expansions as discussed in later sections.

Remark 5

(Interpretation of Kantorovich Operator). The integral in (49) can also be expressed as:

\int_{\frac{k}{n}}^{\frac{k + 1}{n}} f (t) d t = \int_{0}^{\frac{1}{n}} f (t + \frac{k}{n}) d t,

(68)

indicating that

K_{n} (f, x)

represents a shifted average of f over cubes of side length

1 / n

centered at

k / n

.

3. Main Results

In this section, we rigorously analyze the approximation properties of the multivariate neural network operators

A_{n}

,

K_{n}

, and

Q_{n}

by deriving Voronovskaya-type asymptotic expansions. Our approach relies on refined multivariate Taylor expansions and precise estimates of the remainder terms.

3.1. Voronovskaya-Type Expansion for Basic Operators

Theorem 9

(Voronovskaya-Type Asymptotic Expansion). Let

0 < β < 1

,

n \in N

sufficiently large,

x \in R^{N}

, and

f \in C^{m} (R^{N})

, where

m, N \in N

. Assume that all partial derivatives of order m are bounded, i.e.,

f_{α} \in C_{B} (R^{N})

for all multi-indices

α = (α_{1}, \dots, α_{N})

with

| α | = \sum_{i = 1}^{N} α_{i} = m

. Let

0 < ε \leq m

. Then, for the neural network operator

A_{n}

we have

A_{n} (f, x) - f (x) = \sum_{j = 1}^{m} \sum_{| α | = j} \frac{f_{α} (x)}{\prod_{i = 1}^{N} α_{i}!} A_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) + o (\frac{1}{n^{β (m - ε)}}),

(69)

which equivalently implies

n^{β (m - ε)} [A_{n} (f, x) - f (x) - \sum_{j = 1}^{m} \sum_{| α | = j} \frac{f_{α} (x)}{\prod_{i = 1}^{N} α_{i}!} A_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x)] \to 0, n \to \infty .

(70)

Furthermore, if

f_{α} (x) = 0

for all

| α | = 1, \dots, m

, then

n^{β (m - ε)} | A_{n} (f, x) - f (x) | \to 0, n \to \infty .

(71)

Proof.

We begin by defining, for

x_{0}, z \in R^{N}

, the univariate path function

g_{z} (t) : = f (x_{0} + t (z - x_{0})), t \in [0, 1] .

By the chain rule, its j-th derivative satisfies

g_{z}^{(j)} (t) = {[\sum_{i = 1}^{N} (z_{i} - x_{0 i}) \partial_{x_{i}}]}^{j} f (x_{0} + t (z - x_{0})), j = 0, 1, \dots, m .

Using the standard multivariate Taylor formula with integral remainder:

f (z) = \sum_{j = 0}^{m} \frac{g_{z}^{(j)} (0)}{j!} + \frac{1}{(m - 1)!} \int_{0}^{1} {(1 - θ)}^{m - 1} (g_{z}^{(m)} (θ) - g_{z}^{(m)} (0)) d θ,

(72)

where

g_{z}^{(j)} (0) = \sum_{| α | = j} \frac{j!}{\prod_{i = 1}^{N} α_{i}!} \prod_{i = 1}^{N} {(z_{i} - x_{0 i})}^{α_{i}} f_{α} (x_{0}), j = 0, 1, \dots, m .

(73)

Applying the operator

A_{n}

and setting

z = k / n

,

k \in Z^{N}

, we obtain

A_{n} (f, x) - f (x) = \sum_{k \in Z^{N}} f (\frac{k}{n}) Z (n x - k) - f (x) = \sum_{| α | \leq m} \frac{f_{α} (x)}{\prod_{i} α_{i}!} A_{n} (\prod_{i} {(\frac{k_{i}}{n} - x_{i})}^{α_{i}}) + θ_{n},

where the remainder term

θ_{n}

satisfies the estimate

| θ_{n} | \leq \frac{2 ∥ f_{α} ∥_{\infty, m}^{max} N^{m}}{m! n^{m β}} = O (n^{- m β}) = o (1) .

(74)

Consequently, for

0 < ε \leq m

,

| θ_{n} | = o (n^{- β (m - ε)}),

yielding the desired asymptotic expansion (69)-(70). The case

f_{α} = 0

directly implies (71). □

Remark 6.

The above theorem rigorously establishes the rate of convergence for

A_{n} (f, x)

in terms of the smoothness m of f, the dimensionality N, and the parameter β. The remainder estimate is uniform with respect to

x \in R^{N}

under the bounded derivative condition, providing a clear path toward the derivation of analogous expansions for Kantorovich-type operators.

3.2. Preparatory Observations

For any multi-index

α

with

| α | = m

, we adopt the standard combinatorial notation

\frac{m!}{\prod_{i = 1}^{N} α_{i}!},

(75)

and define the remainder integrand in the multivariate Taylor expansion as

\prod_{i = 1}^{N} {(z_{i} - x_{0 i})}^{α_{i}} f_{α} (x_{0} + θ (z - x_{0})), 0 \leq θ \leq 1,

(76)

which will play a central role in estimating the remainder term.

Under the uniform grid assumption

|\frac{k_{i}}{n} - x_{i}| \leq n^{- β}, i = 1, \dots, N,

(77)

we can bound the remainder R as

| R | \leq \frac{2 ∥ f_{α} ∥_{\infty, m}^{max} N^{m}}{m!} n^{- m β} .

(78)

Moreover, summing over all nodes with the weight function

Z (n x - k)

yields the global estimate

| θ_{n} | : = |\sum_{k \in Z^{N}} R Z (n x - k)| = O (n^{- m β}) = o (n^{- β (m - ε)}), n \to \infty .

(79)

These estimates complete the proof of Theorem 9 and provide a solid foundation for deriving analogous Voronovskaya-type expansions for Kantorovich-type operators.

4. Voronovskaya-Type Expansions for Kantorovich and Quasi-Interpolation Operators

We now extend the previous results to Kantorovich-type operators

K_{n}

and quasi-interpolation neural network operators

Q_{n}

. Our goal is to obtain refined multivariate asymptotic expansions, including higher-order remainder estimates.

Theorem 10

(Refined Multivariate Voronovskaya Expansion). Let

f \in C^{m + 2} (R^{N})

with bounded derivatives up to order

m + 2

, and let

0 < β < 1

. For

n \in N

sufficiently large and

x \in R^{N}

, the following expansion holds for the Kantorovich operator

K_{n}

:

\begin{matrix} K_{n} (f, x) - f (x) & = \sum_{j = 1}^{m} \sum_{| α | = j} \frac{f_{α} (x)}{\prod_{i = 1}^{N} α_{i}!} K_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) \\ + \sum_{| α | = m + 1} \frac{f_{α} (x)}{\prod_{i = 1}^{N} α_{i}!} K_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) + R_{n} (x), \end{matrix}

(80)

where the remainder

R_{n} (x)

satisfies the uniform estimate

| R_{n} {(x) | \leq C ∥ f ∥}_{\infty, m + 2}^{max} \frac{N^{m + 2}}{(m + 1)!} n^{- (m + 1) β},

(81)

with a constant

C > 0

independent of x and n. The same expansion holds for

Q_{n}

with the corresponding quasi-interpolation weights.

Proof.

Define the path function for

x_{0}, z \in R^{N}

:

g_{z} (t) : = f (x_{0} + t (z - x_{0})), t \in [0, 1] .

(82)

By the chain rule, its derivatives are

g_{z}^{(j)} (t) = {[\sum_{i = 1}^{N} (z_{i} - x_{0 i}) \partial_{x_{i}}]}^{j} f (x_{0} + t (z - x_{0})), j = 0, 1, \dots, m + 1 .

(83)

Applying the Taylor expansion with integral remainder gives

f (z) = \sum_{j = 0}^{m} \frac{g_{z}^{(j)} (0)}{j!} + \frac{1}{m!} \int_{0}^{1} {(1 - θ)}^{m} g_{z}^{(m + 1)} (θ) d θ .

(84)

For a grid point

z = k / n

, we can write

f (\frac{k}{n}) - f (x) = \sum_{j = 1}^{m + 1} \sum_{| α | = j} \frac{f_{α} (x)}{\prod_{i} α_{i}!} \prod_{i} {(\frac{k_{i}}{n} - x_{i})}^{α_{i}} + R_{k, n} (x),

(85)

with the remainder term

R_{k, n} (x) = \frac{1}{m!} \int_{0}^{1} {(1 - θ)}^{m} \sum_{| α | = m + 1} \frac{(m + 1)!}{\prod_{i} α_{i}!} \prod_{i} {(\frac{k_{i}}{n} - x_{i})}^{α_{i}} f_{α} (x + θ (k / n - x)) d θ .

(86)

Using the grid uniformity assumption

|\frac{k_{i}}{n} - x_{i}| \leq n^{- β}, i = 1, \dots, N,

(87)

we obtain the bound

| R_{k, n} (x) | \leq \frac{{∥ f ∥}_{\infty, m + 2}^{max} N^{m + 2}}{(m + 1)!} n^{- (m + 1) β} .

(88)

Finally, applying the operator

K_{n}

(or

Q_{n}

) and summing over all grid points yields the uniform remainder estimate

R_{n} (x)

in (78), completing the proof. □

Remark 7

(Uniform Convergence and Higher-Order Terms). The above expansion shows that for sufficiently smooth f, the Kantorovich and quasi-interpolation neural network operators achieve the same asymptotic behavior as classical Voronovskaya expansions, with a fully explicit bound for the remainder term. This allows rigorous estimates of the rate of convergence in terms of n, β, N, and the smoothness

m + 2

of f.

4.1. Implications for Approximation Rates

Let

f \in C^{m} (R^{N})

and

0 < ε \leq m

. From the Voronovskaya-type expansion (85)–(88), we obtain the asymptotic estimate

n^{β (m - ε)} |K_{n} (f, x) - f (x)| = O (n^{- β ε}), n \to \infty,

(89)

which provides the precise rate of convergence in terms of the grid parameter

β

and the smoothness index m.

Furthermore, if all partial derivatives of order

| α | \leq m

vanish at x, i.e.,

f_{α} (x) = 0, \forall | α | \leq m,

(90)

then the remainder term dominates the expansion, yielding the higher-order estimate

|K_{n} (f, x) - f (x)| = O (n^{- (m + 1) β}), n \to \infty,

(91)

which explicitly illustrates the gain in the convergence rate due to higher-order smoothness.

These estimates highlight two important aspects:

1.: The general rate (89) depends explicitly on the balance between the smoothness m and the chosen parameter $ε$ , giving a controlled decay of the approximation error.
2.: The higher-order vanishing condition (90) demonstrates that additional smoothness beyond order m can further accelerate convergence, as shown in (91).

5. Second-Order Multivariate Voronovskaya Expansions and $L^{p}$ Estimates

We refine the previous results by including second-order multivariate terms and deriving convergence estimates in

L^{p}

-norms. This allows a precise comparison among the neural network operators

A_{n}

,

K_{n}

, and

Q_{n}

.

Theorem 11

(Second-Order Multivariate Voronovskaya Expansion). Let

f \in C^{m + 2} (R^{N})

with bounded derivatives up to order

m + 2

,

1 \leq p \leq \infty

, and

0 < β < 1

. For sufficiently large

n \in N

and

x \in R^{N}

, the following expansion holds for the Kantorovich-type operator

K_{n}

:

\begin{matrix} K_{n} (f, x) - f (x) & = \sum_{j = 1}^{m} \sum_{| α | = j} \frac{f_{α} (x)}{\prod_{i = 1}^{N} α_{i}!} K_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) \\ + \sum_{| α | = m + 1} \frac{f_{α} (x)}{\prod_{i = 1}^{N} α_{i}!} K_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) \\ + \frac{1}{2} \sum_{| α | = 2} \sum_{i, j = 1}^{N} f_{α_{i} α_{j}} (x) K_{n} ((\cdot - x_{i}) (\cdot - x_{j})) (x) + R_{n} (x), \end{matrix}

(92)

where the remainder

R_{n} (x)

satisfies the uniform and

L^{p}

estimate:

∥ R_{n} ∥_{p} \leq C {∥ f ∥}_{\infty, m + 2}^{max} \frac{N^{m + 2}}{(m + 1)!} n^{- (m + 1) β} .

(93)

The same expansion holds for quasi-interpolation operators

Q_{n}

with corresponding weights.

Proof.

We extend the path function technique: for

x_{0}, z \in R^{N}

, define

g_{z} (t) : = f (x_{0} + t (z - x_{0})), t \in [0, 1],

(94)

with derivatives

g_{z}^{(j)} (t) = {[\sum_{i = 1}^{N} (z_{i} - x_{0 i}) \partial_{x_{i}}]}^{j} f (x_{0} + t (z - x_{0})), j = 0, \dots, m + 2 .

(95)

Applying the multivariate Taylor expansion with integral remainder gives

f (z) = \sum_{j = 0}^{m} \frac{g_{z}^{(j)} (0)}{j!} + \frac{1}{m!} \int_{0}^{1} {(1 - θ)}^{m} g_{z}^{(m + 1)} (θ) d θ .

(96)

Decomposing the second-order terms explicitly for cross derivatives yields

\frac{1}{2} \sum_{i, j = 1}^{N} f_{x_{i} x_{j}} (x) (z_{i} - x_{i}) (z_{j} - x_{j}) .

(97)

For a uniform grid

z = k / n

with

| k_{i} / n - x_{i} | \leq n^{- β}

, the remainder satisfies

| R_{k, n} (x) | \leq \frac{{∥ f ∥}_{\infty, m + 2}^{max} N^{m + 2}}{(m + 1)!} n^{- (m + 1) β} .

(98)

Finally, summing over the grid points with Kantorovich or quasi-interpolation weights, we obtain the uniform and

L^{p}

bound

{∥\sum_{k} w_{k} R_{k, n}∥}_{L^{p}} = O (n^{- (m + 1) β}),

(99)

which completes the proof. □

Corollary 3

(Convergence Rate in

L^{p}

Norm). Under the hypotheses of Theorem 11, for

0 < ε \leq m

, the following

L^{p}

convergence rates hold:

∥ K_{n} {(f) - f ∥}_{L^{p}} = O (n^{- β ε}), n \to \infty,

(100)

∥ Q_{n} {(f) - f ∥}_{L^{p}} = O (n^{- β ε}), n \to \infty .

(101)

Moreover, if all derivatives vanish up to order m, i.e.,

f_{α} (x) = 0

for

| α | \leq m

, the remainder dominates and we obtain the higher-order estimate:

∥ K_{n} {(f) - f ∥}_{L^{p}} = O (n^{- (m + 1) β}), n \to \infty,

(102)

∥ Q_{n} {(f) - f ∥}_{L^{p}} = O (n^{- (m + 1) β}), n \to \infty,

(103)

highlighting the gain due to additional smoothness of f.

Remark 8

(Comparison of Operators). The asymptotic expansions show that

A_{n}

,

K_{n}

, and

Q_{n}

share the same principal term up to order m, with differences manifesting only in the remainder term. Kantorovich-type operators typically provide better

L^{p}

-stability due to integration over the cells, whereas quasi-interpolation operators preserve pointwise accuracy and allow explicit control over cross-derivative contributions.

Theorem 12

(Voronovskaya-type for Multivariate Kantorovich Operators). Let

f \in C_{b}^{m} (R^{N})

and

K_{n}

be the multivariate Kantorovich operator as in Theorem 6. Then, for

x \in R^{N}

, we have

\begin{matrix} K_{n} (f, x) - f (x) & = \sum_{j = 1}^{m} \sum_{\begin{matrix} α \in Z_{+}^{N} \\ | α | = j \end{matrix}} \frac{1}{α!} f_{α} (x) K_{n} (\prod_{i = 1}^{N} {(\cdot_{i} - x_{i})}^{α_{i}}) (x) \\ + o ({(\frac{1}{n} + \frac{1}{n^{β}})}^{m - ε}), \end{matrix}

(104)

as

n \to \infty

, for any

0 < ε \leq m

, where

α! : = \prod_{i = 1}^{N} α_{i}!

and

f_{α}

denotes the partial derivative of order α.

Proof.

Using the multivariate Taylor expansion with integral remainder, for

t \in {[0, 1 / n]}^{N}

and

k \in Z^{N}

, we write

f (t + \frac{k}{n}) - f (x) - \sum_{j = 1}^{m} \sum_{| α | = j} \frac{1}{α!} f_{α} (x) \prod_{i = 1}^{N} {(t_{i} + \frac{k_{i}}{n} - x_{i})}^{α_{i}} = R_{n, k} (t),

(105)

with the remainder

R_{n, k} (t) : = m \int_{0}^{1} {(1 - θ)}^{m - 1} \sum_{| α | = m} \frac{1}{α!} \prod_{i = 1}^{N} {(t_{i} + \frac{k_{i}}{n} - x_{i})}^{α_{i}} [f_{α} (x + θ (t + \frac{k}{n} - x)) - f_{α} (x)] d θ .

(106)

Using the supremum norm of the m-th derivatives, we obtain the estimate

| R_{n, k} (t) | \leq \frac{2 ∥ f_{α} ∥_{\infty, m}^{max} N^{m}}{m!} \prod_{i = 1}^{N} {(| t_{i} | + | \frac{k_{i}}{n} - x_{i} |)}^{α_{i}} .

(107)

Integrating over

{[0, 1 / n]}^{N}

and summing over k, we define

U_{n}^{*} (x) : = \sum_{k \in Z^{N}} n^{N} \int_{{[0, 1 / n]}^{N}} R_{n, k} (t) d t Z (n x - k),

(108)

which represents the total remainder contribution of

K_{n} (f, x)

. By the above estimate, we have

| U_{n}^{*} (x) | \leq \frac{2 ∥ f_{α} ∥_{\infty, m}^{max} N^{m}}{m!} {(\frac{1}{n} + \frac{1}{n^{β}})}^{m},

(109)

implying

| U_{n}^{*} (x) | = O ({(\frac{1}{n} + \frac{1}{n^{β}})}^{m}) = o (1),

(110)

and for any

0 < ε \leq m

,

\frac{| U_{n}^{*} (x) |}{{(\frac{1}{n} + \frac{1}{n^{β}})}^{m - ε}} \to 0, n \to \infty .

(111)

The result follows by splitting the operator

K_{n} (f, x)

into the main polynomial part and the remainder

U_{n}^{*} (x)

. □

Theorem 13

(Voronovskaya-type Expansion for Quadrature Operators). Let

Q_{n}

be the multivariate quadrature-type operator as in Theorem 6, and let

f \in C_{b}^{m} (R^{N})

. Then, for each

x \in R^{N}

,

Q_{n} (f, x) - f (x) = \sum_{j = 1}^{m} \sum_{| α | = j} \frac{1}{α!} f_{α} (x) Q_{n} (\prod_{i = 1}^{N} {(\cdot_{i} - x_{i})}^{α_{i}}) (x) + o ({(\frac{1}{n} + \frac{1}{n^{β}})}^{m - ε}),

(112)

as

n \to \infty

, for any

0 < ε \leq m

.

Proof.

We proceed in a series of detailed:

For a fixed quadrature node

(k, r)

, we expand f around x:

f (\frac{k}{n} + \frac{r}{n θ}) = f (x) + \sum_{j = 1}^{m} \sum_{| α | = j} \frac{1}{α!} f_{α} (x) \prod_{i = 1}^{N} {(\frac{k_{i}}{n} + \frac{r_{i}}{n θ_{i}} - x_{i})}^{α_{i}} + R_{n, k, r},

(113)

where the remainder

R_{n, k, r}

admits the integral form

R_{n, k, r} = m \int_{0}^{1} {(1 - θ)}^{m - 1} \sum_{| α | = m} \frac{1}{α!} \prod_{i = 1}^{N} {(\frac{k_{i}}{n} + \frac{r_{i}}{n θ_{i}} - x_{i})}^{α_{i}} [f_{α} (x + θ (\frac{k}{n} + \frac{r}{n θ} - x)) - f_{α} (x)] d θ .

(114)

Using the uniform boundedness of derivatives

f_{α}

on

R^{N}

, we get

| R_{n, k, r} | \leq \frac{2 N^{m}}{m!} {∥ f_{α} ∥}_{\infty, m}^{max} \prod_{i = 1}^{N} | \frac{k_{i}}{n} + \frac{r_{i}}{n θ_{i}} - x_{i} |^{α_{i}} .

(115)

The operator

Q_{n}

involves weighted sums over the quadrature points r with weights

w_{r}

. Thus, for each k, we have

| \sum_{r} w_{r} R_{n, k, r} | \leq \frac{2 N^{m}}{m!} {∥ f_{α} ∥}_{\infty, m}^{max} {(\frac{1}{n} + ∥ \frac{k}{n} - x ∥_{\infty})}^{m} .

(116)

Next, summing over

k \in Z^{N}

with the kernel

Z (n x - k)

, which decays sufficiently fast, yields

E_{n}^{*} (x) : = \sum_{k \in Z^{N}} \sum_{r} w_{r} R_{n, k, r} Z (n x - k) \leq \frac{2 N^{m}}{m!} {∥ f_{α} ∥}_{\infty, m}^{max} \sum_{k \in Z^{N}} Z (n x - k) {(\frac{1}{n} + ∥ \frac{k}{n} - x ∥_{\infty})}^{m} .

(117)

Using the decay property of Z and standard estimates for lattice sums, we get

| E_{n}^{*} (x) | = O ({(\frac{1}{n} + \frac{1}{n^{β}})}^{m}) = o (1), n \to \infty .

(118)

The parameter

β

reflects the trade-off between kernel truncation error and Taylor remainder. Choosing

β = 1 / 2

ensures that

\frac{| E_{n}^{*} (x) |}{{(\frac{1}{n} + \frac{1}{n^{β}})}^{m - ε}} \to 0 as n \to \infty,

(119)

for any

0 < ε \leq m

, providing an optimal convergence rate for the remainder.

Combining the main term from the Taylor expansion with the remainder estimate gives

Q_{n} (f, x) = f (x) + \sum_{j = 1}^{m} \sum_{| α | = j} \frac{1}{α!} f_{α} (x) Q_{n} (\prod_{i = 1}^{N} {(\cdot_{i} - x_{i})}^{α_{i}}) (x) + E_{n}^{*} (x),

(120)

which establishes the claimed Voronovskaya-type expansion. □

6. Voronovskaya-Type Expansion in Sobolev $W^{s; p}$

Theorem 14

(Voronovskaya Expansion in

W^{s, p}

). Let

m \in N

,

0 \leq s \leq m

, and

1 \leq p < \infty

. Assume

f \in W^{m, p} (R^{N})

and that the moments

M_{α} : = \int_{R^{N}} u^{α} Z (u) d u

(121)

exist for all multi-indices

| α | \leq m

. Then, for the operator

A_{n}

,

A_{n} f (x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{M_{α}}{α! n^{| α |}} \partial^{α} f (x) + r_{n, s} (x),

(122)

where

r_{n, s} \in W^{s, p} (R^{N})

and

∥ r_{n, s} ∥_{W^{s, p}} = o (n^{- (m - s)}), n \to \infty .

(123)

Proof.

For each lattice point

k \in Z^{N}

, the multivariate Taylor expansion with integral remainder gives

f (\frac{k}{n}) = \sum_{| α | \leq m - s} \frac{\partial^{α} f (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{n, m} (x, k),

(124)

where the remainder term admits the integral representation

R_{n, m} (x, k) = \sum_{| α | = m - s + 1}^{m} \frac{m!}{α! (m - | α |)!} \int_{0}^{1} {(1 - t)}^{m - | α |} \partial^{α} f (x + t (\frac{k}{n} - x)) {(\frac{k}{n} - x)}^{α} d t .

(125)

By linearity of

A_{n}

and the definition

A_{n} f (x) = \sum_{k \in Z^{N}} f (\frac{k}{n}) Z (n x - k),

(126)

we have

A_{n} f (x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{\partial^{α} f (x)}{α!} \sum_{k \in Z^{N}} {(\frac{k}{n} - x)}^{α} Z (n x - k) + \sum_{k \in Z^{N}} R_{n, m} (x, k) Z (n x - k) .

(127)

Using the definition of the moments

M_{α}

and the scaling properties of Z, we obtain

\sum_{k \in Z^{N}} {(\frac{k}{n} - x)}^{α} Z (n x - k) = \frac{M_{α}}{n^{| α |}} + O (n^{- | α | - 1}),

(128)

recovering the main term of the expansion:

\sum_{1 \leq | α | \leq m - s} \frac{M_{α}}{α! n^{| α |}} \partial^{α} f (x) .

(129)

The remainder can be expressed as a discrete convolution:

r_{n, s} (x) = \sum_{k \in Z^{N}} R_{n, m} (x, k) Z (n x - k) .

(130)

Differentiating under the summation (allowed since Z is smooth and compactly supported) and applying Minkowski’s inequality, we get

∥ \partial^{β} r_{n, s} ∥_{L^{p}} \leq \sum_{| α | = m - s + 1}^{m} \frac{C_{α}}{α!} \sum_{k \in Z^{N}} \int_{0}^{1} {∥ \partial^{α} f (x + t (\frac{k}{n} - x)) ∥}_{L^{p}} ∥ \frac{k}{n} - x ∥^{| α |} Z (n x - k) d t,

(131)

for all multi-indices

| β | \leq s

.

Passing to the Riemann sum and applying Young’s inequality for discrete convolutions yields

∥ \partial^{β} r_{n, s} ∥_{L^{p}} \leq C n^{- (m - s)} {∥ \partial^{m} f ∥}_{L^{p}} .

(132)

Hence, the remainder satisfies (123), and combining (129) with (130) completes the proof. □

7. Quantitative $L^{p}$ Rate with Explicit Constants

Theorem 15

(Quantitative

L^{p}

Approximation Rate). Let

f \in W^{m, p} (R^{N})

,

1 \leq p \leq \infty

, and assume the kernel moments

M_{α} : = \int_{R^{N}} u^{α} Z (u) d u

(133)

exist and are finite for all multi-indices

| α | \leq m

. Then there exists a constant

C : = C (N, m, p, Z) : = \sum_{| γ | = m} \frac{1}{γ!} \int_{R^{N}} {| u |}^{| γ |} | Z (u) | d u

(134)

such that

{∥A_{n} f - f - \sum_{1 \leq | α | \leq m - 1} \frac{M_{α}}{α! n^{| α |}} \partial^{α} f∥}_{L^{p}} \leq \frac{{C ∥ f ∥}_{W^{m, p}}}{n^{m}} .

(135)

Proof.

For each lattice point

k \in Z^{N}

, the multivariate Taylor expansion with integral remainder yields

f (\frac{k}{n}) = f (x) + \sum_{1 \leq | α | \leq m - 1} \frac{\partial^{α} f (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{m} (x, k),

(136)

with the remainder

R_{m} (x, k) = \sum_{| γ | = m} \frac{m}{γ!} \int_{0}^{1} {(1 - t)}^{m - 1} \partial^{γ} f (x + t (\frac{k}{n} - x)) {(\frac{k}{n} - x)}^{γ} d t .

(137)

By linearity of

A_{n}

,

A_{n} f (x) - f (x) - \sum_{1 \leq | α | \leq m - 1} \frac{M_{α}}{α! n^{| α |}} \partial^{α} f (x) = \sum_{k \in Z^{N}} R_{m} (x, k) Z (n x - k) + E_{n} (x),

(138)

where

E_{n} (x) = \sum_{1 \leq | α | \leq m - 1} \frac{\partial^{α} f (x)}{α!} [\sum_{k \in Z^{N}} {(\frac{k}{n} - x)}^{α} Z (n x - k) - \frac{M_{α}}{n^{| α |}}] .

(139)

Since Z is rapidly decaying, one has

E_{n} (x) = O (n^{- m})

.

Change variables

u = n x - k

, so

\frac{k}{n} - x = - \frac{u}{n}

. Then

\sum_{k \in Z^{N}} | R_{m} (x, k) | | Z (n x - k) | \leq \sum_{| γ | = m} \frac{1}{γ!} \sum_{u \in Z^{N}} \int_{0}^{1} {(1 - t)}^{m - 1} | \partial^{γ} f (x - \frac{u}{n} t) | \frac{{| u |}^{m}}{n^{m}} | Z (u) | d t .

(140)

Taking the

L^{p}

-norm and applying Minkowski’s inequality:

∥ \sum_{k \in Z^{N}} R_{m} (x, k) Z (n x - k) ∥_{L^{p}} \leq \sum_{| γ | = m} \frac{1}{γ!} \frac{1}{n^{m}} \int_{0}^{1} {(1 - t)}^{m - 1} {∥\sum_{u \in Z^{N}} | \partial^{γ} f (x - \frac{u}{n} t) {| | u |}^{m} | Z (u) |∥}_{L^{p}} d t .

(141)

The sum over u approximates the integral

\sum_{u \in Z^{N}} | \partial^{γ} f (x - \frac{u}{n} t) {| | u |}^{m} | Z (u) | \approx n^{N} \int_{R^{N}} | \partial^{γ} f (x - y t) {| | y |}^{m} | Z (n y) | d y \leq \int_{R^{N}} {| u |}^{m} | Z (u) | d u ∥ \partial^{γ} {f ∥}_{L^{p}} .

(142)

Hence,

∥ \sum_{k \in Z^{N}} R_{m} (x, k) Z (n x - k) ∥_{L^{p}} \leq \frac{{C ∥ f ∥}_{W^{m, p}}}{n^{m}},

(143)

with C given explicitly by (134).

Combining the estimates for

R_{m}

and

E_{n}

, we obtain the desired quantitative

L^{p}

rate (135). □

8. Unified Voronovskaya-Type Expansion in Sobolev $W^{s, p}$ with Explicit Constants

Theorem 16

(Voronovskaya-Type Expansion in

W^{s, p}

with Explicit Constants). Let

f \in W^{m, p} (R^{N})

, with

m \in N

,

0 \leq s \leq m

, and

1 \leq p \leq \infty

. Assume the kernel Z has finite moments

M_{α} : = \int_{R^{N}} u^{α} Z (u) d u, | α | \leq m,

(144)

and is rapidly decaying. Then for the operator

A_{n}

we have

A_{n} f (x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{M_{α}}{α! n^{| α |}} \partial^{α} f (x) + r_{n, s} (x),

(145)

with remainder

r_{n, s} \in W^{s, p} (R^{N})

satisfying the explicit quantitative bound

∥ r_{n, s} ∥_{W^{s, p}} \leq \frac{{C (N, m, p, Z) ∥ f ∥}_{W^{m, p}}}{n^{m - s}},

(146)

where the constant is given explicitly by

C (N, m, p, Z) : = \sum_{| γ | = m} \frac{1}{γ!} \int_{R^{N}} {| u |}^{| γ |} | Z (u) | d u .

(147)

Proof.

For each lattice point

k \in Z^{N}

, consider the multivariate Taylor expansion around x:

f (\frac{k}{n}) = \sum_{| α | \leq m - 1} \frac{\partial^{α} f (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{m} (x, k),

(148)

with integral remainder

R_{m} (x, k) = \sum_{| γ | = m} \frac{m}{γ!} \int_{0}^{1} {(1 - t)}^{m - 1} \partial^{γ} f (x + t (\frac{k}{n} - x)) {(\frac{k}{n} - x)}^{γ} d t .

(149)

By linearity of

A_{n}

, we obtain

A_{n} f (x) - f (x) - \sum_{1 \leq | α | \leq m - s} \frac{M_{α}}{α! n^{| α |}} \partial^{α} f (x) = \sum_{k \in Z^{N}} R_{m} (x, k) Z (n x - k) + E_{n} (x),

(150)

where

E_{n} (x)

accounts for the approximation error in replacing discrete sums by the exact moments. Rapid decay of Z ensures

E_{n} (x) = O (n^{- m})

.

Perform the change of variables

u = n x - k \Rightarrow k / n - x = - u / n

. Then

\sum_{k \in Z^{N}} | R_{m} (x, k) | | Z (n x - k) | \leq \sum_{| γ | = m} \frac{1}{γ! n^{m}} \sum_{u \in Z^{N}} \int_{0}^{1} {(1 - t)}^{m - 1} | \partial^{γ} f (x - \frac{t u}{n}) {| | u |}^{m} | Z (u) | d t .

(151)

Applying discrete Minkowski inequality and standard Sobolev shift estimates, we deduce

∥ r_{n, s} ∥_{W^{s, p}} \leq \frac{{C (N, m, p, Z) ∥ f ∥}_{W^{m, p}}}{n^{m - s}},

(152)

with

C (N, m, p, Z)

given explicitly in (147).

Combining all these steps, the Voronovskaya-type expansion in Sobolev space

W^{s, p}

with an explicit quantitative rate is established. □

9. Uniform Stability of Voronovskaya-Type Expansions under $(q, λ)$ Variations

Definition 3

(Uniform

C^{m}

Kernel with Controlled Moments). Let

U \subset {(0, \infty)}^{2}

be compact. We say that the family

{Z_{q, λ}}_{(q, λ) \in U}

is uniformly

C^{m}

with controlled momentsif

sup_{(q, λ) \in U} \int_{R^{N}} {| u |}^{| γ |} | \partial^{β} Z_{q, λ} (u) | d u < \infty, \forall | γ |, | β | \leq m .

(153)

Theorem 17

(Uniform

W^{s, p}

Stability). Let

{Z_{q, λ}}_{(q, λ) \in U}

be uniformly

C^{m}

with controlled moments on a compact set U, and let

f \in W^{m, p} (R^{N})

. Denote by

A_{n}^{(q, λ)}

the corresponding operators. Then the Voronovskaya-type expansions

A_{n}^{(q, λ)} f (x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(q, λ)}}{α! n^{| α |}} \partial^{α} f (x) + r_{n, s}^{(q, λ)} (x)

(154)

are uniform in

(q, λ) \in U

, in the sense that

sup_{(q, λ) \in U} {∥ r_{n, s}^{(q, λ)} ∥}_{W^{s, p}} = o (n^{- (m - s)}), n \to \infty,

(155)

with constants in the estimates depending only on the suprema in (153).

Proof.

For each

(q, λ) \in U

and lattice point

k \in Z^{N}

, consider the multivariate Taylor expansion of f around x:

f (\frac{k}{n}) = \sum_{| α | \leq m - 1} \frac{\partial^{α} f (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{m}^{(q, λ)} (x, k),

(156)

where

R_{m}^{(q, λ)} (x, k)

is the integral remainder.

By linearity of the operator

A_{n}^{(q, λ)}

, we write

A_{n}^{(q, λ)} f (x) - f (x) - \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(q, λ)}}{α! n^{| α |}} \partial^{α} f (x) = \sum_{k \in Z^{N}} R_{m}^{(q, λ)} (x, k) Z_{q, λ} (n x - k) + E_{n}^{(q, λ)} (x),

(157)

where

E_{n}^{(q, λ)} (x)

accounts for the discrete-to-continuum moment approximation error.

By assumption (153), the integrals

\int_{R^{N}} {| u |}^{| γ |} | \partial^{β} Z_{q, λ} (u) | d u

(158)

are uniformly bounded over

(q, λ) \in U

. Therefore, each term in the remainder estimate satisfies

∥ r_{n, s}^{(q, λ)} ∥_{W^{s, p}} \leq \sum_{| γ | = m} \frac{1}{γ! n^{m - s}} \int_{R^{N}} {| u |}^{m} sup_{(q, λ) \in U} | \partial^{γ} Z_{q, λ} {(u) | d u ∥ f ∥}_{W^{m, p}} .

(159)

The uniform integrability of

\partial^{β} Z_{q, λ}

allows the application of the dominated convergence theorem as

n \to \infty

, yielding the uniform bound (155) and establishing the uniformity of the Voronovskaya expansions in

W^{s, p}

. □

10. Unified Voronovskaya-Type Expansion with Explicit Constants and Uniform Stability

10.1. Setup

Let

f \in W^{m, p} (R^{N})

,

m \in N

,

0 \leq s \leq m

,

1 \leq p \leq \infty

. Let

Z \in C^{m} (R^{N})

(or a parametric family

{Z_{q, λ}}_{(q, λ) \in U}

) with moments

M_{α} : = \int_{R^{N}} u^{α} Z (u) d u, | α | \leq m,

(160)

and assume rapid decay and, for parametric families, uniform boundedness:

sup_{(q, λ) \in U} \int_{R^{N}} {| u |}^{| γ |} | \partial^{β} Z_{q, λ} (u) | d u < \infty, | γ |, | β | \leq m .

(161)

Define the multivariate quasi-interpolation operators

A_{n}^{(\cdot)} f (x) : = \sum_{k \in Z^{N}} f (\frac{k}{n}) Z_{(\cdot)} (n x - k), \sum_{k} Z_{(\cdot)} (n x - k) = 1 .

(162)

10.2. Unified Voronovskaya-Type Theorem

Theorem 18

(Compact Voronovskaya Expansion with Explicit Constants). Under the above assumptions, the operator

A_{n}^{(\cdot)}

satisfies

A_{n}^{(\cdot)} f (x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(\cdot)}}{α! n^{| α |}} \partial^{α} f (x) + r_{n, s}^{(\cdot)} (x),

(163)

where the remainder

r_{n, s}^{(\cdot)} \in W^{s, p} (R^{N})

satisfies the uniform quantitative estimate

∥ r_{n, s}^{(\cdot)} ∥_{W^{s, p}} \leq \frac{C (N, m, p, Z)}{n^{m - s}} {∥ f ∥}_{W^{m, p}}, C (N, m, p, Z) : = \sum_{| γ | = m} \frac{1}{γ!} \int_{R^{N}} {| u |}^{m} | Z (u) | d u .

(164)

For parametric families

(q, λ) \in U

, the expansion is uniform:

sup_{(q, λ) \in U} {∥ r_{n, s}^{(q, λ)} ∥}_{W^{s, p}} = o (n^{- (m - s)}), n \to \infty .

(165)

Moreover, in the case

s = 0

, this yields an explicit

L^{p}

rate

∥ A_{n}^{(\cdot)} f - f - \sum_{1 \leq | α | \leq m - 1} \frac{M_{α}^{(\cdot)}}{α! n^{| α |}} \partial^{α} f ∥_{L^{p}} \leq \frac{{C (N, m, p, Z) ∥ f ∥}_{W^{m, p}}}{n^{m}} .

(166)

Proof.

Expand

f (k / n)

using the multivariate Taylor formula of order m with integral remainder:

f (\frac{k}{n}) = \sum_{| α | \leq m - 1} \frac{\partial^{α} f (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{m}^{(\cdot)} (x, k),

(167)

with

R_{m}^{(\cdot)} (x, k) = \sum_{| γ | = m} \frac{m}{γ!} \int_{0}^{1} {(1 - t)}^{m - 1} \partial^{γ} f (x + t (\frac{k}{n} - x)) {(\frac{k}{n} - x)}^{γ} d t .

(168)

Applying

A_{n}^{(\cdot)}

gives (163) with

r_{n, s}^{(\cdot)} (x) : = \sum_{k \in Z^{N}} R_{m}^{(\cdot)} (x, k) Z_{(\cdot)} (n x - k) .

(169)

For

| β | \leq s

, differentiate under the sum:

\partial^{β} r_{n, s}^{(\cdot)} (x) = \sum_{k \in Z^{N}} R_{m}^{(\cdot)} (x, k) \partial^{β} Z_{(\cdot)} (n x - k) n^{| β |} .

(170)

By Young’s inequality and the uniform boundedness of the kernel moments, this yields (164). For parametric families, dominated convergence and the uniform bounds in (161) imply uniformity over U. □

10.3. Remarks

This theorem simultaneously captures the classical Voronovskaya expansion, quantitative $L^{p}$ rates, Sobolev estimates, and uniform stability under parametric families.
Constants are fully explicit, depending only on kernel moments and derivatives.
The framework is directly applicable to numerical analysis, multivariate approximation, and theoretical studies of Sobolev-space operators.

We develop a rigorous framework for adaptive Voronovskaya-type expansions for multivariate neural network operators with dynamically deformed hyperbolic tangent activations. This extends classical asymptotic expansions to non-stationary kernels, where the deformation parameters adapt with the approximation scale. Our results establish accelerated convergence rates in Sobolev spaces

W^{m, p} (R^{N})

while ensuring uniform control of derivatives up to order

s \leq m

. Precise remainder estimates are provided, demonstrating the interplay between kernel deformation and Sobolev regularity, and enabling applications in high-dimensional function approximation and adaptive deep learning architectures.

11. Setup and Hypotheses for Sobolev-Santos Uniform Adaptive Convergence Theorem

Definition 4

(Adaptive Quasi-Interpolation Operator). Let

f \in W^{m, p} (R^{N})

. Define the adaptive operator

A_{n}^{(q_{n}, λ_{n})} (f, x) : = \sum_{k \in Z^{N}} f (\frac{k}{n}) Z_{q_{n}, λ_{n}} (n x - k),

(171)

where the multivariate adaptive kernel is

Z_{q_{n}, λ_{n}} (x) : = \prod_{i = 1}^{N} Φ_{q_{n}, λ_{n}} (x_{i}), Φ_{q_{n}, λ_{n}} (x_{i}) : = \frac{M_{q_{n}, λ_{n}} (x_{i}) + M_{1 / q_{n}, λ_{n}} (x_{i})}{2} .

(172)

The parameters

q_{n} > 0

and

λ_{n} > 0

are adaptive, satisfying

q_{n} \to 1, λ_{n} \to \infty, λ_{n} = O (n^{γ}), 0 < γ < 1 .

(173)

Hypotheses.

1.: Controlled Deformation: There exist constants $0 < C_{1} \leq C_{2} < \infty$ such that $C_{1} \leq q_{n} \leq C_{2}$ for all $n \in N$ .
2.: Uniform Exponential Decay: For all multi-indices $α$ with $| α | \leq m$ , there exists $T_{α} > 0$ such that

$\sum_{k \in Z^{N}} {| k |}^{α} Z_{q_{n}, λ_{n}} (k) \leq T_{α}, \forall n .$
3.: Uniformly Bounded Moments: For all $| α | \leq m$ ,

$sup_{n} | \int_{R^{N}} u^{α} Z_{q_{n}, λ_{n}} (u) d u | < \infty .$

Theorem 19

(Sobolev-Santos Uniform Adaptive Convergence). Let

f \in W^{m, p} (R^{N})

with

1 \leq p \leq \infty

and

0 \leq s \leq m

. Assume the adaptive quasi-interpolation operator

A_{n}^{(q_{n}, λ_{n})}

satisfies the hypotheses of controlled deformation, uniform exponential decay, and bounded moments. Then, for sufficiently large n, the following adaptive Voronovskaya expansionholds:

A_{n}^{(q_{n}, λ_{n})} (f, x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(n)}}{α! n^{| α |}} \partial^{α} f (x) + r_{n}^{(s)} (x),

(174)

where the multivariate moments are

M_{α}^{(n)} : = \int_{R^{N}} u^{α} Z_{q_{n}, λ_{n}} (u) d u,

(175)

and the remainder term

r_{n}^{(s)}

satisfies the explicit estimate

∥ r_{n}^{(s)} ∥_{W^{s, p} (R^{N})} \leq C (N, m, p, f) \frac{Φ (q_{n}, λ_{n})}{n^{m - s + γ}},

(176)

with

Φ (q_{n}, λ_{n})

a smooth, bounded function quantifying the deviation from stationarity:

Φ (q_{n}, λ_{n}) : = max_{| α | = m - s + 1} \{\int_{R^{N}} {| u |}^{α} | Z_{q_{n}, λ_{n}} (u) - Z_{1, \infty} (u) | d u\} = O (1), n \to \infty .

Moreover, the expansion is uniform in Sobolev normup to order s, i.e.,

sup_{| β | \leq s} ∥ \partial^{β} (A_{n}^{(q_{n}, λ_{n})} (f) - f - \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(n)}}{α! n^{| α |}} \partial^{α} f) ∥_{L^{p}} \leq \frac{C (N, m, p, f)}{n^{m - s + γ}} .

(177)

Remarks:

1.: The factor $Φ (q_{n}, λ_{n})$ explicitly captures the influence of adaptive kernel deformation on the remainder, providing a quantitative measure of non-stationarity.
2.: If $q_{n} \equiv 1$ and $λ_{n} \to \infty$ , the theorem recovers the classical stationary Voronovskaya expansion.
3.: The theorem can be extended to fractional Sobolev spaces $W^{m + σ, p}$ with $0 < σ < 1$ , allowing finer control for functions with limited smoothness.
4.: Uniformity in s ensures stability of derivatives up to order s, which is crucial for high-dimensional deep learning applications where gradients are propagated through multiple layers.

Proof. Let

f \in W^{m, p} (R^{N})

and

0 \leq s \leq m

. The proof proceeds in several steps.

For any

x, y \in R^{N}

, by Taylor’s theorem with integral remainder, we have

f (y) = \sum_{| α | \leq m - s} \frac{\partial^{α} f (x)}{α!} {(y - x)}^{α} + R_{m - s} (x, y),

(178)

where the remainder is explicitly

R_{m - s} (x, y) = \sum_{| α | = m - s + 1} \frac{(m - s + 1)}{α!} \int_{0}^{1} {(1 - t)}^{m - s} \partial^{α} f (x + t (y - x)) {(y - x)}^{α} d t .

(179)

Substituting

y = k / n

in (171) yields

A_{n}^{(q_{n}, λ_{n})} (f, x) = \sum_{k \in Z^{N}} [\sum_{| α | \leq m - s} \frac{\partial^{α} f (x)}{α!} {(\frac{k}{n} - x)}^{α} + R_{m - s} (x, k / n)] Z_{q_{n}, λ_{n}} (n x - k) .

(180)

By definition of the multivariate moments

M_{α}^{(n)} : = \int_{R^{N}} u^{α} Z_{q_{n}, λ_{n}} (u) d u,

(181)

and using the discrete-to-continuum approximation via the kernel’s exponential decay, we have

\sum_{k \in Z^{N}} {(\frac{k}{n} - x)}^{α} Z_{q_{n}, λ_{n}} (n x - k) = \frac{M_{α}^{(n)}}{n^{| α |}} .

(182)

Hence, the main contribution in (174) is

\sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(n)}}{α! n^{| α |}} \partial^{α} f (x) .

Define the remainder operator

r_{n}^{(s)} (x) : = \sum_{k \in Z^{N}} R_{m - s} (x, k / n) Z_{q_{n}, λ_{n}} (n x - k) .

(183)

Applying Minkowski’s inequality and using the uniform exponential decay and bounded moments of

Z_{q_{n}, λ_{n}}

, we obtain

\begin{matrix} ∥ r_{n}^{(s)} ∥_{L^{p}} & \leq \sum_{| α | = m - s + 1} \frac{m - s + 1}{α!} \int_{0}^{1} {(1 - t)}^{m - s} \sum_{k \in Z^{N}} | \partial^{α} {f (x + t (k / n - x)) | | k / n - x |}^{α} Z_{q_{n}, λ_{n}} (n x - k) d t \\ \leq \frac{C (N, m, p, f)}{n^{m - s + γ}} . \end{matrix}

(184)

where

C (N, m, p, f)

is independent of n and

γ \in (0, 1)

is the decay exponent from the kernel’s scaling.

The smoothness of

Z_{q_{n}, λ_{n}}

guarantees that differentiation commutes with

A_{n}^{(q_{n}, λ_{n})}

up to order s, so for all multi-indices

β

with

| β | \leq s

,

∥ \partial^{β} r_{n}^{(s)} ∥_{L^{p}} \leq \frac{C (N, m, p, f)}{n^{m - s + γ}} .

(185)

Consequently,

∥ A_{n}^{(q_{n}, λ_{n})} (f) - f - \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(n)}}{α! n^{| α |}} \partial^{α} {f ∥}_{W^{s, p}} = {∥ r_{n}^{(s)} ∥}_{W^{s, p}} \leq \frac{C}{n^{m - s + γ}} .

(186)

This completes the proof. □

12. Results

This study establishes rigorous theoretical results for symmetrized hyperbolic tangent neural network operators, with a focus on the novel Sobolev-Santos Uniform Convergence Theorem. The main contributions are as follows:

1.: Voronovskaya-Type Expansions for Basic Operators: For functions $f \in C^{m} (R^{N})$ , the approximation error of the basic operator $A_{n}$ admits an asymptotic expansion:

$A_{n} (f, x) - f (x) = \sum_{j = 1}^{m} \sum_{| α | = j} \frac{f_{α} (x)}{α!} A_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) + o (\frac{1}{n^{β (m - ε)}}) .$

This expansion provides explicit convergence rates, dependent on the smoothness m of f and the grid parameter $β$ .
2.: Refined Expansions for Kantorovich and Quadrature Operators: For $f \in C^{m + 2} (R^{N})$ , the Kantorovich operator $K_{n}$ satisfies a refined expansion:

$K_{n} (f, x) - f (x) = \sum_{j = 1}^{m + 1} \sum_{| α | = j} \frac{f_{α} (x)}{α!} K_{n} (\prod_{i = 1}^{N} {(\cdot - x_{i})}^{α_{i}}) (x) + R_{n} (x),$

where the remainder $R_{n} (x)$ is bounded by $O (n^{- (m + 1) β})$ , demonstrating higher-order accuracy.
3.: Sobolev Space Estimates: The approximation error in the Sobolev space $W^{s, p} (R^{N})$ is bounded by:

$∥ A_{n} {f - f ∥}_{W^{s, p}} = O (n^{- (m - s)}) .$

This result provides quantitative estimates for the convergence rate, with explicit constants derived from the moments of the kernel function.
4.: Sobolev-Santos Uniform Convergence Theorem: The Sobolev-Santos Theorem (Theorem ) establishes that for adaptive quasi-interpolation operators $A_{n}^{(q_{n}, λ_{n})}$ , the following expansion holds:

$A_{n}^{(q_{n}, λ_{n})} (f, x) - f (x) = \sum_{1 \leq | α | \leq m - s} \frac{M_{α}^{(n)}}{α! n^{| α |}} \partial^{α} f (x) + r_{n}^{(s)} (x),$

where the remainder $r_{n}^{(s)}$ satisfies the explicit estimate:

$∥ r_{n}^{(s)} ∥_{W^{s, p}} \leq \frac{C (N, m, p, f) Φ (q_{n}, λ_{n})}{n^{m - s + γ}} .$

The function $Φ (q_{n}, λ_{n})$ quantifies the deviation from stationarity, ensuring uniform stability under parametric variations of the activation function. This theorem is pivotal for applications requiring adaptive kernel deformation, such as high-dimensional deep learning architectures.
5.: Uniform Stability Under Parametric Variations: The expansions remain uniformly valid even when the activation function parameters $(q, λ)$ vary, ensuring robustness in practical applications. This stability is critical for adaptive neural network architectures, where parameters may dynamically adjust during training or optimization.

13. Conclusions

This work advances the theory of neural network approximation by introducing symmetrized hyperbolic tangent-based operators and deriving Voronovskaya-type asymptotic expansions for their multivariate counterparts. The Sobolev-Santos Uniform Convergence Theorem is a cornerstone of this study, providing a rigorous framework for adaptive quasi-interpolation operators with dynamically deformed activation functions. This theorem ensures that the operators can approximate smooth functions and their derivatives with high accuracy, even as the deformation parameters

(q_{n}, λ_{n})

evolve, while maintaining uniform control over the convergence rates in Sobolev spaces

W^{s, p} (R^{N})

.

The explicit constants and uniform bounds derived in this study offer a solid foundation for both theoretical and applied research in neural network-based function approximation. The results highlight the superior performance of these operators in high-dimensional approximation problems, with direct implications for artificial intelligence, numerical analysis, and data-driven modeling. The uniform stability under parametric variations further enhances their applicability in adaptive deep learning architectures, where robustness and flexibility are essential.

Future research directions include exploring adaptive grid strategies, extending the framework to fractional Sobolev spaces, and generalizing the results to non-Euclidean domains. These advancements could further expand the applicability of symmetrized hyperbolic tangent neural networks in modern computational frameworks, particularly in scenarios requiring high-dimensional function approximation and adaptive learning.

Acknowledgments

Santos gratefully acknowledges the support of the PPGMC Program for the Postdoctoral Scholarship PROBOL/UESC nr. 218/2025. Sales would like to express his gratitude to CNPq for the financial support under grant 30881/2025-0.

References

Anastassiou, G. A. (1997). Rate of convergence of some neural network operators to the unit-univariate case. Journal of Mathematical Analysis and Applications, 212(1), 237-262. [CrossRef]
Anastassiou, G. (2000). Quantitative approximations. Chapman and Hall/CRC. [CrossRef]
Anastassiou, G. A. (2016). Intelligent Systems II: Complete Approximation by Neural Network Operators (Vol. 608). Cham: Springer International Publishing. [CrossRef]
Anastassiou, G. A. (2023). Parametrized, deformed and general neural networks. Berlin/Heidelberg, Germany: Springer. [CrossRef]
Z. Chen and F. Cao, The approximation operators with sigmoidal functions, Computers and Mathematics with Applications, 58 (2009), 758-765. [CrossRef]
Haykin, S. (1994). Neural networks: a comprehensive foundation. Prentice hall PTR.
McCulloch, W. S., & Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics, 5(4), 115-133. [CrossRef]
Yu, D., & Cao, F. (2025). Construction and approximation rate for feedforward neural network operators with sigmoidal functions. Journal of Computational and Applied Mathematics, 453, 116150. [CrossRef]
Yoo, J., Kim, J., Gim, M., & Lee, H. (2024). Error estimates of physics-informed neural networks for initial value problems. Journal of the Korean Society for Industrial and Applied Mathematics, 28(1), 33-58.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Adaptive Voronovskaya-Type Expansions and Sobolev-Santos Uniform Convergence for Symmetrized Hyperbolic Tangent Neural Networks

Abstract

Keywords:

Subject:

1. Introduction

2. Mathematical Formulations

Symmetrization Method

2.1. Key Properties and Extremal Values

2.2. Partition of Unity Property

2.3. Normalization Property

2.4. Exponential Decay Property

2.5. Multivariate Extension via Tensor Products

2.6. Neural Network Operators

3. Main Results

3.1. Voronovskaya-Type Expansion for Basic Operators

3.2. Preparatory Observations

4. Voronovskaya-Type Expansions for Kantorovich and Quasi-Interpolation Operators

4.1. Implications for Approximation Rates

5. Second-Order Multivariate Voronovskaya Expansions and $L^{p}$ Estimates

6. Voronovskaya-Type Expansion in Sobolev $W^{s; p}$

7. Quantitative $L^{p}$ Rate with Explicit Constants

8. Unified Voronovskaya-Type Expansion in Sobolev $W^{s, p}$ with Explicit Constants

9. Uniform Stability of Voronovskaya-Type Expansions under $(q, λ)$ Variations

10. Unified Voronovskaya-Type Expansion with Explicit Constants and Uniform Stability

10.1. Setup

10.2. Unified Voronovskaya-Type Theorem

10.3. Remarks

11. Setup and Hypotheses for Sobolev-Santos Uniform Adaptive Convergence Theorem

Hypotheses.

12. Results

13. Conclusions

Acknowledgments

References

MDPI Initiatives

Important Links

Subscribe

Adaptive Voronovskaya-Type Expansions and Sobolev-Santos Uniform Convergence for Symmetrized Hyperbolic Tangent Neural Networks

Abstract

Keywords:

Subject:

1. Introduction

2. Mathematical Formulations

Symmetrization Method

2.1. Key Properties and Extremal Values

2.2. Partition of Unity Property

2.3. Normalization Property

2.4. Exponential Decay Property

2.5. Multivariate Extension via Tensor Products

2.6. Neural Network Operators

3. Main Results

3.1. Voronovskaya-Type Expansion for Basic Operators

3.2. Preparatory Observations

4. Voronovskaya-Type Expansions for Kantorovich and Quasi-Interpolation Operators

4.1. Implications for Approximation Rates

5. Second-Order Multivariate Voronovskaya Expansions and L p Estimates

6. Voronovskaya-Type Expansion in Sobolev W s ; p

7. Quantitative L p Rate with Explicit Constants

8. Unified Voronovskaya-Type Expansion in Sobolev W s , p with Explicit Constants

9. Uniform Stability of Voronovskaya-Type Expansions under ( q , λ ) Variations

10. Unified Voronovskaya-Type Expansion with Explicit Constants and Uniform Stability

10.1. Setup

10.2. Unified Voronovskaya-Type Theorem

10.3. Remarks

11. Setup and Hypotheses for Sobolev-Santos Uniform Adaptive Convergence Theorem

Hypotheses.

12. Results

13. Conclusions

Acknowledgments

References

MDPI Initiatives

Important Links

Subscribe

5. Second-Order Multivariate Voronovskaya Expansions and $L^{p}$ Estimates

6. Voronovskaya-Type Expansion in Sobolev $W^{s; p}$

7. Quantitative $L^{p}$ Rate with Explicit Constants

8. Unified Voronovskaya-Type Expansion in Sobolev $W^{s, p}$ with Explicit Constants

9. Uniform Stability of Voronovskaya-Type Expansions under $(q, λ)$ Variations