-
Notifications
You must be signed in to change notification settings - Fork 159
Open
Description
When item parsing ML-10M, for some cases, release year cannot be obtained.
origin_name.find('(') catches alternate name. I modified extended_dataset.py L.284 load_item_data.
m = re.compile(r'\((\d+)\)') for i in range(origin_data.shape[0]): split_type = origin_data.iloc[i, 2].split('|') type_str = ' '.join(split_type) processed_data.iloc[i, 2] = type_str origin_name = origin_data.iloc[i, 1] r = m.search(origin_name) if r: year = r.group(1) year_start = r.start()+1 #year_start = origin_name.find('(') + 1 #year_end = origin_name.find(')') title_end = year_start - 2 #year = origin_name[year_start:year_end] title = origin_name[0: title_end]
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels