Pitch detection based on neural-net classifiers is investigated. To this end, the extent of generalization attainable with neural nets is first examined, and it is shown that a suitable choice of features is required to utilize this property. Specifically, invariant features should be used whenever possible. For pitch detection, two feature sets, one based on waveform samples and the other based on properties of waveform peaks, are introduced. Experiments with neural classifiers demonstrate that the latter feature set, which has better invariance properties, performs more successfully. It is found that the best neural-net pitch tracker approaches the level of agreement of human labelers on the same data set, and performs competitively in comparison to a sophisticated feature-based tracker. An analysis of the errors committed by the neural net (relative to the hand labels used for training) reveals that they are mostly due to inconsistent hand labeling of ambiguous waveform peaks.