Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DMM scraper cannot process DMM standard naming convention #339

Open
redraiderj opened this issue Oct 9, 2020 · 0 comments
Open

DMM scraper cannot process DMM standard naming convention #339

redraiderj opened this issue Oct 9, 2020 · 0 comments
Labels

Comments

@redraiderj
Copy link

DMM has its own naming convention for video files purchased and downloaded from their website. For example, instead of IPX-043, it could be ipx00043hhb or ipx00043mhb based on your bitrate selection.

Here is a list for filenames based on different bitrate:
300K: ipx00043sm
1000K: ipx00043dm
2000K: ipx00043mmb
3000K: ipx00043mhb
4000K: ipx00043hmb
6000K: ipx00043hhb

Also there are other video files which have prefix, for example h_286stm00030mhb, 1havd00779mhb, 60xv00675mhb and 53dv00959mhb. So I would suggest using regex to process them.

In DmmParsingProfile.createSearchString() function, you can add below code fragment before URLCodec codec = new URLCodec();

	Pattern patternID = Pattern.compile("(h?_?[0-9]*[a-z]+[0-9]+)");
	Matcher matcher = patternID.matcher(fileNameNoExtension);
	while (matcher.find()) {
		fileNameNoExtension = matcher.group(1);
	}

In Movie.scrapeMovie() function, when you construct string idFromMovieFileToMatch, you might need to replace the postfix I mentioned above, like sm, dm, mmb, mhb, hmb and hhb.

@redraiderj redraiderj added the bug label Oct 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant