目前訓練數(shù)據(jù)中,總會出現(xiàn)loss非常高的樣本,那就需要針對這類樣本進行更多的訓練,由于按照樣本長度已經(jīng)將樣本放在不同的bucket中,那就需要針對某個bucket進行更多采樣。
以下代碼為樣例:
import random
def rand_pick(seq, probabilities):
x = random.uniform(0, 1)
cumprob = 0.0
for item , item_pro in zip(seq, probabilities):
cumprob += item_pro
if x < cumprob:
break
return item
value_list = [0 , 1, 2]
probabilities = [0.4 , 0.3, 0.3]
for i in range(10):
print(rand_pick(value_list, probabilities))