1. A Necessary and Sufficient Condition That Ordinary Least-Squares Estimators Be Best Linear Unbiased
2. The assumptions about X'X and plim ((1/n)X'X) are actually necessary for identification, without which no estimator can be unbiased or consistent Wouldn't It Be Nice … ? The Automatic Unbiasedness of OLS (and GLS)
3. The first (K + 1) × 1 vector is n times the second; the assumption is that they are both zero.
4. Of course, even assuming E(X'u) = 0, X'u ≠ 0. That is, the sample (as opposed to the population) mean of the disturbance (as opposed to the residual) and its sample (as opposed to population) covariances with the regressors will not be zero. If the assumption is correct (and the sample not tiny), they will generally be close but will never be zero exactly (assuming a continuous disturbance).
5. Given that the sample mean of û is 0, is n times the sample covariance between xk and û.