This paper studies extreme values in infinite moving average processes $X_t = \sum_\lambda c_{\lambda - t} Z_\lambda$ defined from an i.i.d. noise sequence $\{Z_\lambda\}$. In particular this includes the ARMA-processes often used in time series analysis. A fairly complete extremal theory is developed for the cases when the d.f. of the $Z_\lambda$'s has a smooth tail which decreases approximately as $\exp\{- z^p\}$ as $z \rightarrow \infty$, for $0 < p < \infty$, or as a power of $z$. The influence of the averaging on extreme values depends on $p$ and the $c_\lambda$'s in a rather intricate way. For $p = 2$, which includes normal sequences, the correlation function $r_t = \sum_\lambda c_{\lambda - t}c_\lambda/\sum_\lambda c^2_\lambda$ determines extremal behavior while, perhaps more surprisingly, for $p \neq 2$ correlations have little bearing on extremes. Further, the sample paths of $\{X_t\}$ near extreme values asymptotically assume a specific nonrandom form, which again depends on $p$ and $\{c_\lambda\}$ in an interesting way. One use of this latter result is as an informal quantitative check of a fitted moving average (or ARMA) model, by comparing the sample path behavior predicted by the model with the observed sample paths.