Skip to content

Commit

Permalink
One word per line + name changig for 3rd party licenses
Browse files Browse the repository at this point in the history
One word per line so that it's easier to see diff when people push minor changes to a stopword file

3rd party license file naming change is to maybe not confuse GitHub as to which license the library uses
  • Loading branch information
eklem committed Jan 15, 2023
1 parent 22d134c commit 00044a3
Show file tree
Hide file tree
Showing 73 changed files with 54,199 additions and 1,327 deletions.
File renamed without changes.
13,350 changes: 13,284 additions & 66 deletions dist/stopword.cjs.js

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion dist/stopword.cjs.min.js

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion dist/stopword.esm.min.mjs

Large diffs are not rendered by default.

13,350 changes: 13,284 additions & 66 deletions dist/stopword.esm.mjs

Large diffs are not rendered by default.

15,464 changes: 14,341 additions & 1,123 deletions dist/stopword.umd.js

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion dist/stopword.umd.min.js

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion package.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
{
"name": "stopword",
"version": "2.0.5",
"version": "2.0.6",
"description": "A module for node.js and the browser that takes in text and returns text that is stripped of stopwords. Has pre-defined stopword lists for 62 languages and also takes lists with custom stopwords as input.",
"main": "./dist/stopword.cjs.js",
"module": "./dist/stopword.esm.js",
Expand Down
2 changes: 1 addition & 1 deletion rollup.config.js
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ export default [
banner: {
commentStyle: 'regular', // The default
content: {
file: path.join(__dirname, './src/license-references.txt')
file: path.join(__dirname, './src/reference.txt')
}
}
})
Expand Down
2 changes: 1 addition & 1 deletion src/license-references.txt → src/reference.txt
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,4 @@ Short version for minified scripts:
The MIT License (MIT)
Copyright (c) 2015 - 2022 Fergus McDowall

Full license text + Third party licenses found in ./LICENSES.txt
Full license text + Third party licenses found in ./3rd-party.txt
62 changes: 58 additions & 4 deletions src/stopwords_afr.js
Original file line number Diff line number Diff line change
@@ -1,20 +1,74 @@
/* Copyright 2016 Liam Doherty
Licensed under the Apache License, Version 2.0 (the "License");
Licensed under the Apache License,
Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
Unless required by applicable law or agreed to in writing,
software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND,
either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
*/

/* This list is frequency sorted. That means it can be sliced from the bottom
and be less agressive in excluding stopwords */

const afr = ['die', 'het', 'en', 'sy', 'nie', 'was', 'hy', 'te', 'is', 'ek', 'om', 'hulle', 'in', 'my', '\'n', 'vir', 'toe', 'haar', 'van', 'dit', 'op', 'se', 'wat', 'met', 'gaan', 'baie', 'ons', 'jy', 'na', 'maar', 'hom', 'so', 'n', 'huis', 'kan', 'aan', 'dat', 'daar', 'sal', 'jou', 'gesê', 'by', 'kom', 'een', 'ma', 'as', 'son', 'groot', 'begin', 'al']
const afr = [
'die',
'het',
'en',
'sy',
'nie',
'was',
'hy',
'te',
'is',
'ek',
'om',
'hulle',
'in',
'my',
'\'n',
'vir',
'toe',
'haar',
'van',
'dit',
'op',
'se',
'wat',
'met',
'gaan',
'baie',
'ons',
'jy',
'na',
'maar',
'hom',
'so',
'n',
'huis',
'kan',
'aan',
'dat',
'daar',
'sal',
'jou',
'gesê',
'by',
'kom',
'een',
'ma',
'as',
'son',
'groot',
'begin',
'al'
]
export { afr }
Loading

0 comments on commit 00044a3

Please sign in to comment.