In its Uncooked frequency sort, tf is simply the frequency on the "this" for every document. In Each and every document, the phrase "this" appears the moment; but given that the document 2 has far more words and phrases, its relative frequency is scaled-down.epoch. Because of this a Dataset.batch applied following Dataset.repeat will yield batches