In this article, we investigate how the performance of Massive MIMO cellular systems can be enhanced by introducing D2D communication. We consider a scenario where the base station (BS) is equipped with large, but finite number of antennas and the total number of UEs is kept fixed. The key design question is that what fraction of users should be offloaded to D2D mode in order to maximize the aggregate cell level throughput. We demonstrate that there exists an optimal user offload fraction, which maximizes the overall capacity. This fraction is strongly coupled with the network parameters such as the number of antennas at the BS, D2D link distance and the transmit SNR at both the UE and the BS and careful tuning of the offload fraction can provide up to 5× capacity gains.