"For the rational study of the law the blackletter man may be the man of the present, but the man of the future is the man of statistics." β Oliver Wendell Holmes Jr, "The Path of the Law", Harvard Law Review, 10: 457β478 (1897)
I'm Rens Jansen, a legal engineer at Deloitte Tax & Legal. During my studies I've developed an interest in how legal services can be improved with the help of technology (i.e. legal tech). This interest has led me to align my career and self-development with a multidisciplinary focus on data science in the legal industry. As such, my current focus is on developing pan-European legal research solutions at Moonlit.
As the legal domain is largely text-based, my interests naturally gravitate towards:
- Natural language processing (NLP);
- Information retrieval (IR).
More specifically, I'm interested in:
- Text mining;
- Network analysis;
- Large-scale multi-label text classification (LMTC);
- Domain adaptation of large language models (LLMs);
- Transfer learning in domain-specific areas, such as the legal domain, suffers from a word distribution shift when comparing between general and domain-specific corpora.
- This challenge of lacking performance is further exacerbated by tokenizers which incorrectly tokenize domain-specific terminology (source).
- The aforementioned is especially problematic in low-resource languages.
In connection to my interest I'm currently exploring the following projects/libraries:
- LiDO's LinkeXtractor
- ICLR&D's Blackstone / Addleshaw Goddard's Blackstone Web API
- CiteURL
Let's connect if you share my interests. Most of my contact details can be found below.