We consider the following basic, and very broad, statistical problem: Given a known high-dimensional distribution ${\cal D}$ over $\R^n$ and a collection of data points in $\R^n$, distinguish between the two possibilities that (i) the data was drawn from ${\cal D}$, versus (ii) the data was drawn from ${\cal D}|_S$, i.e. from ${\cal D}$ subject to truncation by an unknown truncation set $S \subseteq \R^n$.
We study this problem in the setting where ${\cal D}$ is a high-dimensional i.i.d.~product distribution and $S$ is an unknown degree-$d$ polynomial threshold function (one of the most well-studied types of Boolean-valued function over $\R^n$). Our main results are an efficient algorithm when ${\cal D}$ is a hypercontractive distribution, and a matching lower bound: