Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scrapper for DMM Seems to be Broken Again using the Latest Code #333

Open
rickytor81 opened this issue Sep 10, 2020 · 7 comments
Open

Scrapper for DMM Seems to be Broken Again using the Latest Code #333

rickytor81 opened this issue Sep 10, 2020 · 7 comments
Labels

Comments

@rickytor81
Copy link

The scrapper for DMM appears to be broken again. I have been using the latest code uploaded by @Wizell, it works well in the past few days. It seems the DMM scrapper broken again today, it is not returning anything even with the manual url.

@rickytor81 rickytor81 added the bug label Sep 10, 2020
@rickytor81 rickytor81 changed the title Scrapper for DMM Seems to be Broken Again using Latest Code Scrapper for DMM Seems to be Broken Again using the Latest Code Sep 10, 2020
@zuko7177
Copy link
Contributor

zuko7177 commented Sep 11, 2020

DMM scraper relies on grammerchecker.net to do translation. That site is not reliable. If that site is not working, DMM scraper will not work properly. See #325 (comment)

@rickytor81
Copy link
Author

DMM scraper relies on grammerchecker.net to do translation. That site is not reliable. If that site is not working, DMM scraper will not work properly. See #325 (comment)

Thank you @zuko7177 for the reply. I am not sure if the issue relates to the translation. I have the option enable to force Japanese scrapping through "Scrape JAV Movies in Japanese Instead Of English". It works fine until yesterday. I tried also using manual url, but still returning nothing. Thanks for looking into it.

@zuko7177
Copy link
Contributor

zuko7177 commented Sep 11, 2020

I have a pull request to improve DMM scraping. #332
In the meantime, if you're familiar with the process you can clone my repo and try it out.

Also, take a look at https://github.com/jvlflame/Javinizer
I found out about it recently. It's great.

@rickytor81
Copy link
Author

I have a pull request to improve DMM scraping. #332
In the meantime, if you're familiar with the process you can clone my repo and try it out.

Also, take a look at https://github.com/jvlflame/Javinizer
I found out about it recently. It's great.

Thank you @zuko7177 ! With your latest repo, it works! The speed is also improved a lot!! I mean A LOT!!!

@koonfoon
Copy link

koonfoon commented Jun 3, 2021

Hi @zuko7177 recently I tried to scrape from DMM using command line, but it gave me error:

Filename =  ../plex/ssni-852/ssni-852.mp4
Parsing with parsing profile = class moviescraper.doctord.controller.siteparsingprofile.specific.DmmParsingProfile
DMM Scraper: Search string --> https://www.dmm.co.jp/search/=/searchstr=ssni-852/
Scraping this webpage for movie: https://www.dmm.co.jp/mono/dvd/-/detail/=/cid=ssni852/?i3_ref=search&i3_ord=2
DMM Scraper: getting JP version at https://www.dmm.co.jp/mono/dvd/-/detail/=/cid=ssni852/?i3_ref=search&i3_ord=2
DMM Scraper: Title --> 華奢な少女の人生初!絶頂ポルチオ開発 巨根×膣中イキオーガズム 槙いずな
DMM Scraper: Plot  --> 槙いずな、人生初のポルチオ開発宣言。奥のさらに奥…ポルチオ徹底開発!!「奥ダメぇぇ!!!子宮が下がってるぅぅぅ…」ズボボッ!!極太バイブ、汗みどろ巨大ペニス喉マンコ拡張イラマ、前代未聞の超ケイレン絶頂3P!おま●こヒクヒク!巨根で抉じ開ける…快感電流ビッキーン!!腹筋ガクガク大痙攣。込み上がる快感オーガズム!極細クビレBODYがイキ乱れ、華奢な 少女が失神するまでケダモノ絶頂!
DMM Scraper: getting actresses from https://actress.dmm.co.jp/-/detail/=/actress_id=1059504/
Exception in thread "main" java.lang.NullPointerException: Cannot invoke "org.jsoup.nodes.Element.attr(String)" because "actressThumbnailElement" is null
        at moviescraper.doctord.controller.siteparsingprofile.specific.DmmParsingProfile.scrapeActors(DmmParsingProfile.java:583)
        at moviescraper.doctord.model.Movie.<init>(Movie.java:139)
        at moviescraper.doctord.model.Movie.scrapeMovie(Movie.java:821)
        at moviescraper.doctord.Main.runScrape(Main.java:215)
        at moviescraper.doctord.Main.main(Main.java:114)

It seem like actress thumbnail is missing? DMM made changes again on their site source code? Please advise. Thank you.

@rickytor81
Copy link
Author

koonfoon, a quick containment is to disable actress scraper, "Scrapper's Settings/DMM/Scrape Actress". It works at least, then you need to manfully add actress info.

Of course, it is only a containment. Let's wait for comments from @zuko7177.

@koonfoon
Copy link

I am able to fix "actressThumbnailElement" is null error. The line of code that causing error:

// Error: unable to select the element
Element actressThumbnailElement = actressPage.select("tr.area-av30.top td img").first();

Changed the css query:

Element actressThumbnailElement = actressPage.select("span.p-section-profile__image img").first();

This is working on my dev/test environment. But I had accidentally update my actual environment that run this scraper to ```JDK 16``. 😭 Now I got this error:

Caused by: java.lang.reflect.InaccessibleObjectException: Unable to make field private final java.util.Comparator java.util.TreeMap.comparator accessible: module java.base does not "opens java.util" to unnamed module @5fb2de77

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants