This paper considers the downlink of a heteroge- neous network, where multiple base stations (BSs) can serve the users by non-coherent multiflow beamforming. We assume imperfect channel state information at both BSs and users. The objective is to jointly optimize the precoding, load balancing, and BS operation mode (active or sleep) for improving the energy efficiency of the network. The considered problem is to minimize the weighted total power consumption (both circuit power and dynamic transmit power), while satisfying per-user quality of service constraints and per-BS transmit power constraints. This problem is non-convex, but we prove that for each combination of BS modes, the considered problem has a hidden convexity structure. Thus, the global optimal solution is obtained by an exhaustive search over all possible BS mode combinations. Furthermore, by iterative convex approximations of the non- convex power consumption functions, a heuristic algorithm is proposed to obtain a local optimal solution with low complex- ity. Simulation results illustrate that our proposed algorithms significantly reduce the total power consumption, compared to the scheme where all BSs are continuously active. This implies that putting a BS into sleep mode by proper load balancing is an important solution for energy savings in heterogeneous networks.