We study the conditional distribution of low-dimensional projections from
high-dimensional data, where the conditioning is on other low-dimensional
projections. To fix ideas, consider a random d-vector Z that has a Lebesgue
density and that is standardized so that EZ=0 and
EZZ′=Id. Moreover, consider two projections defined by unit-vectors
α and β, namely a response y=α′Z and an explanatory
variable x=β′Z. It has long been known that the conditional mean of y
given x is approximately linear in xundersomeregularityconditions;cf.HallandLi[Ann.Statist.21(1993)867−889].However,acorrespondingresultfortheconditionalvariancehasnotbeenavailablesofar.Wehereshowthattheconditionalvarianceofygivenxisapproximatelyconstantinx(again,undersomeregularityconditions).Theseresultsholduniformlyin\alphaandformost\beta′s,providedonlythatthedimensionofZislarge.Inthatsense,weseethatmostlinearsubmodelsofahigh−dimensionaloverallmodelareapproximatelycorrect.Ourfindingsprovidenewinsightsinavarietyofmodelingscenarios.Wediscussseveralexamples,includingslicedinverseregression,slicedaveragevarianceestimation,generalizedlinearmodelsunderpotentiallinkviolation,andsparselinearmodeling.