<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.2 20190208//EN" "JATS-archivearticle1-mathml3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties manuscript?><front><journal-meta><journal-id journal-id-type="nlm-journal-id">101529983</journal-id><journal-id journal-id-type="pubmed-jr-id">38510</journal-id><journal-id journal-id-type="nlm-ta">Test (Madr)</journal-id><journal-id journal-id-type="iso-abbrev">Test (Madr)</journal-id><journal-title-group><journal-title>Test (Madrid, Spain)</journal-title></journal-title-group><issn pub-type="ppub">1133-0686</issn><issn pub-type="epub">1863-8260</issn></journal-meta><article-meta><article-id pub-id-type="pmid">33281439</article-id><article-id pub-id-type="pmc">7717567</article-id><article-id pub-id-type="doi">10.1007/s11749-018-0621-3</article-id><article-id pub-id-type="manuscript">NIHMS1524638</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>Comments on: Process modeling for slope and aspect with application to elevation data maps</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Banerjee</surname><given-names>Sudipto</given-names></name></contrib><aff id="A1">Department of Biostatistics, University of California, Los Angeles, CA 90095-1772. USA Tel.: +1-310-825-5916, Fax: +1-310-267-2113, <email>sudipto@ucla.edu</email></aff></contrib-group><pub-date pub-type="nihms-submitted"><day>28</day><month>3</month><year>2019</year></pub-date><pub-date pub-type="epub"><day>12</day><month>11</month><year>2018</year></pub-date><pub-date pub-type="ppub"><month>12</month><year>2018</year></pub-date><pub-date pub-type="pmc-release"><day>04</day><month>12</month><year>2020</year></pub-date><volume>27</volume><issue>4</issue><fpage>773</fpage><lpage>775</lpage><!--elocation-id from pubmed: 10.1007/s11749-018-0621-3--><self-uri xlink:href="https://link.springer.com/article/10.1007%2Fs11749-018-0621-3"/></article-meta></front><body><p id="P1">It is a pleasure to comment on this interesting article by Wang, Bhattacharya and Gelfand. The article discusses formal Bayesian inference on topographic features such as slope and aspect using information from GIS. The authors have nicely exploited and extended some of the previously established results on spatial gradients to infer on topographic slopes and aspects. Bayesian inference, and sampling-based computation of the posterior predictive distribution, is very convenient here because the topographic functions of interest are fairly simple functions of directional spatial gradients. Therefore, posterior samples for slopes and aspects are immediately obtained from the posterior samples of the directional gradients.</p><p id="P2">The key underlying feature of such modeling is that finite difference increments of stationary Gaussian processes are again Gaussian processes and, hence, so are their limits as the increments become negligibly small (see, e.g., <xref rid="R4" ref-type="bibr">Parzen, 1962</xref>). In fact, as the authors have correctly noted, since the smoothness of the process realizations is determined by the stationary covariance function, one only needs to specify an appropriate covariance function to ensure the existence of the gradient process. This leads to an elegant distribution theory for spatial gradients that can be embedded within hierarchical modeling contexts to carry out Bayesian inference for directional rates of change (e.g., <xref rid="R2" ref-type="bibr">Banerjee, Gelfand and Sirmans, 2003</xref>) or even spatiotemporal gradients (Quick, Banerjee and Carlin, 2013).</p><p id="P3">This framework can, in fact, be generalized to infer on sufficiently smooth functionals of the process. Indeed, with an appropriate specification for a spatial process <italic>Y</italic>(<italic>s</italic>), we can derive the multivariate process <inline-formula><mml:math display="inline" id="M1" overflow="scroll"><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math display="inline" id="M2" overflow="scroll"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> is a linear functional of <italic>Y</italic>(<italic>s</italic>), and carry out posterior predic-tive inference on the posterior predictive distribution <inline-formula><mml:math display="inline" id="M3" overflow="scroll"><mml:mrow><mml:mo stretchy="false">[</mml:mo><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>S</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:math></inline-formula>, where <italic>Y</italic>(S) are observations of <italic>Y</italic>(<italic>s</italic>) over a finite set of locations S. One example is the estimation of gradients along curves to derive Bayesian detection rules for so called curves of rapid change or &#x0201c;wombling boundaries&#x0201d; (<xref rid="R1" ref-type="bibr">Banerjee and Gelfand, 2006</xref>). In fact, estimating maximal gradient processes (that are central in the current paper) are a central part of &#x0201c;wombling&#x0201d; or boundary detection problems (see, e.g., Figure 2c in <xref rid="R1" ref-type="bibr">Banerjee and Gelfand, 2006</xref>). Since the joint process <inline-formula><mml:math display="inline" id="M4" overflow="scroll"><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> is a well-defined stochastic process over the entire domain, one can predict either of <italic>Y</italic>(<italic>s</italic>) or <inline-formula><mml:math display="inline" id="M5" overflow="scroll"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> at arbitrary locations, including where none of the two processes have been observed. In full generality, we can compute the predictive densities <inline-formula><mml:math display="inline" id="M6" overflow="scroll"><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mi>Y</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>U</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>U</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">|</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>, where S<sub>1</sub> and S<sub>2</sub> are sets of locations yielding observations on <italic>Y</italic>(<italic>s</italic>) and <inline-formula><mml:math display="inline" id="M7" overflow="scroll"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>, respectively, and <italic>U</italic><sub>1</sub> and <italic>U</italic><sub>2</sub> are sets of locations where predictions are sought for <italic>Y</italic>(<italic>s</italic>) and <inline-formula><mml:math display="inline" id="M8" overflow="scroll"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula>, respectively.</p><p id="P4">And while we are at it, why not extend even further to quantities of interest in vector analysis or differential geometry? For example, we can express the process in terms of the random position vector <italic>r</italic>(<italic>u</italic>, <italic>v</italic>) = <italic>(u,v,Y</italic>(<italic>s</italic>)), where <italic>s</italic> = (<italic>u</italic>,<italic>v</italic>) and infer on the basis vectors <italic>r</italic><sub><italic>u</italic></sub>(<italic>s</italic>) = &#x02202;<italic>r</italic>(<italic>u</italic>, <italic>v</italic>)/&#x02202;<italic>u</italic> and <italic>r</italic><sub><italic>v</italic></sub>(<italic>s</italic>) = &#x02202;<italic>r</italic>(<italic>u</italic>, <italic>v</italic>)/&#x02202;<italic>v</italic>, spanning the so called tangent plane. Subsequently, we can compute the &#x0201c;first fundamental form&#x0201d; (<italic>E</italic>(<italic>s</italic>),<italic>F</italic>(<italic>s</italic>),<italic>G</italic>(<italic>s</italic>)), defined as
<disp-formula id="FD1"><mml:math display="block" id="M9" overflow="scroll"><mml:mrow><mml:mi>E</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x02329;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>u</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>u</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x0232a;</mml:mo><mml:mo>;</mml:mo><mml:mi>F</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x02329;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>u</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>v</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x0232a;</mml:mo><mml:mo>;</mml:mo><mml:mi>G</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x02329;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>v</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>v</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x0232a;</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
where &#x02329;&#x000b7;, &#x000b7;&#x0232a; denotes an appropriate inner-product. The first fundamental form completely determines several quantities of interest including curls, surface areas and arc-lengths. For instance, the differential element of surface area, say <italic>dA</italic> (<italic>s</italic>), is approximated by the area of the local patch on the tangent plane to the surface at (<italic>s</italic>) and is given by the cross product of the fundamental vectors as
<disp-formula id="FD2"><mml:math display="block" id="M10" overflow="scroll"><mml:mrow><mml:mi>d</mml:mi><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x02016;</mml:mo><mml:mo>&#x02202;</mml:mo><mml:mi>r</mml:mi><mml:mo>/</mml:mo><mml:mo>&#x02202;</mml:mo><mml:mi>u</mml:mi><mml:mo>&#x000d7;</mml:mo><mml:mo>&#x02202;</mml:mo><mml:mi>r</mml:mi><mml:mo>/</mml:mo><mml:mo>&#x02202;</mml:mo><mml:mi>v</mml:mi><mml:mo>&#x02016;</mml:mo><mml:mi>d</mml:mi><mml:mi>s</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p><p id="P5">This can be shown as equivalent to <italic>dA</italic>(<italic>s</italic>) = <italic>E</italic>(<italic>s</italic>)<italic>G</italic>(<italic>s</italic>) &#x02014; <italic>F</italic>(<italic>s</italic>)<sup>2</sup>. The surface area of <italic>r</italic> (<italic>s</italic>) is the continuous sum (integral) of the areas of these infinitesimal parallelograms on the surface and is defined as,
<disp-formula id="FD3"><mml:math display="block" id="M11" overflow="scroll"><mml:mrow><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>D</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x0222b;</mml:mo><mml:mi>d</mml:mi><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>u</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mstyle><mml:mrow><mml:msub><mml:mo>&#x0222b;</mml:mo><mml:mrow><mml:mi>s</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:msqrt><mml:mrow><mml:mi>E</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>G</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x02212;</mml:mo><mml:mi>F</mml:mi><mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:mrow></mml:mstyle><mml:mi>d</mml:mi><mml:mi>s</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p><p id="P6">The first fundamental form depends only upon the gradient (components of &#x00394;<italic>r</italic>), so statistical inference on the above quantities and their functions (such as A(D)) fits well within the framework provided in the paper. In fact, in the sampling-based framework one would only need to compute the gradients of <italic>r</italic>(<italic>s</italic>), which would immediately provide the posterior distributions of the components of the first fundamental form. The inference on physical quantities in classical vector analysis involving higher-order differentials such as the Laplacians, curvatures and divergences can also be formulated using appropriate hyper-surface parametrizations.</p><p id="P7">As is apparent from the above, one could indulge oneself with the distribution theory available for linear functionals of Gaussian processes. But are such problems scientifically relevant? What types of applications would demand such inference? This is not yet clear to me. In spatial statistics, inferring on <inline-formula><mml:math display="inline" id="M12" overflow="scroll"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> given observations on the response seems to have been relevant for understanding zones and boundaries of rapid change. However, I have not seen many applications involving information on gradients or linear functionals. This is perhaps because reliable information on <inline-formula><mml:math display="inline" id="M13" overflow="scroll"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mi>Y</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> is difficult to gather. We usually observe elevation, not gradients. In computer experiments, where Gaussian processes are widely used as response surface emulators for complex functional outputs, the computer program (or the physical system) can provide information on gradients. Such information can, then, be used as part of the data to interpolate the response surface (see, e.g., <xref rid="R3" ref-type="bibr">Morris, Mitchell and Ylvisaker, 1993</xref>). I think similar explorations on predictive perfomances when gradient information is available, perhaps from DEMs or DTMs, will be an interesting exercise to pursue. The authors state that they never observe the slope or aspect process. This is correct, but can DEMs provide such information? Will incorporating such information help enhance inferential performance?</p><p id="P8">I conclude with a few remarks specific to the paper&#x02019;s contribution. This is relevant since a casual reader may erroneously conclude that much of the paper rehashes established results on the theory of spatial gradients. The authors have derived two novel results, both of which play a central role in the inference of slope and aspect processes. The first establishes conditions for the independence between the slope and the aspect processes. The second outlines conditions for the aspect to have, a priori, a circular uniform distribution. The proofs are elegant and indeed the results help in building intuition behind these processes. I do note that the authors have focused on inference for the mean E[<italic>Y</italic> (<italic>s</italic>)] = <italic>&#x003bc;</italic>(<italic>s</italic>) + <italic>w</italic>(<italic>s</italic>), which will require the mean function <italic>&#x003bc;</italic>(<italic>s</italic>) to be smooth and admit gradients. This may not always be true: consider settings where <italic>&#x003bc;</italic>(<italic>s</italic>) = <italic>x</italic>(<italic>s</italic>)<sup>&#x022a4;</sup><italic>&#x003b2;</italic> and some of the explanatory variables are categorical. Inference for &#x02207;<italic>w</italic>(<italic>s</italic>) is still permissible. Finally, while slopes and aspects on the mean may be most relevant in purely spatial contexts, estimating slopes and aspects in spatiotemporal latent processes <italic>w</italic>(<italic>s</italic>, <italic>t</italic>) and testing for their dynamic behavior for the underlying process may still be useful in understanding local behavior of latent processes.</p></body><back><ref-list><title>References</title><ref id="R1"><mixed-citation publication-type="journal"><name><surname>Banerjee</surname><given-names>S</given-names></name> and <name><surname>Gelfand</surname><given-names>AE</given-names></name>
<article-title>Bayesian Wombling: Curvilinear gradient assessment under spatial process models</article-title>. <source>Journal of the American Statistical Association</source>, <volume>101</volume>, <fpage>1487</fpage>&#x02013;<lpage>1501</lpage> (<year>2006</year>).<pub-id pub-id-type="pmid">20221318</pub-id></mixed-citation></ref><ref id="R2"><mixed-citation publication-type="journal"><name><surname>Banerjee</surname><given-names>S</given-names></name>, <name><surname>Gelfand</surname><given-names>AE</given-names></name> and <name><surname>Sirmans</surname><given-names>CF</given-names></name>
<article-title>Directional rates of change under spatial process models</article-title>. <source>Journal of the American Statistical Association</source>
<volume>98</volume>, <fpage>946</fpage>&#x02013;<lpage>954</lpage> (<year>2003</year>).</mixed-citation></ref><ref id="R3"><mixed-citation publication-type="journal"><name><surname>Morris</surname><given-names>MD</given-names></name>, <name><surname>Mitchell</surname><given-names>TJ</given-names></name> and <name><surname>Ylvisaker</surname><given-names>D</given-names></name>
<article-title>Bayesian design and analysis of computer experiments: Use of derivatives in surface prediction</article-title>. <source>Technometrics</source>, <volume>35</volume>, <fpage>243</fpage>&#x02013;<lpage>255</lpage> (<year>1993</year>).</mixed-citation></ref><ref id="R4"><mixed-citation publication-type="book"><name><surname>Parzen</surname><given-names>S</given-names></name>
<source>Stochastic Processes</source>. <publisher-name>Holden-Day</publisher-name>: <publisher-loc>San Fransisco</publisher-loc> (<year>1962</year>).</mixed-citation></ref><ref id="R5"><mixed-citation publication-type="journal"><name><surname>Quick</surname><given-names>H</given-names></name>, <name><surname>Banerjee</surname><given-names>S</given-names></name> and <name><surname>Carlin</surname><given-names>BP</given-names></name>
<article-title>Bayesian modeling and analysis for gradients in spatiotemporal processes</article-title>. <source>Biometrics</source>, <volume>71</volume>, <fpage>575</fpage>&#x02013;<lpage>584</lpage> (<year>2015</year>).<pub-id pub-id-type="pmid">25898989</pub-id></mixed-citation></ref></ref-list></back></article>