-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
110 changed files
with
19,605 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,29 @@ | ||
Copyright (c) 2010 TakeLab | ||
|
||
Text Analysis and Knowledge Engineering Lab (http://takelab.fer.hr) | ||
Faculty of Electrical Engineering and Computing | ||
University of Zagreb | ||
|
||
Coral License | ||
|
||
Permission is hereby granted, free of charge, to any person obtaining | ||
a copy of this software and associated documentation files (the | ||
"Software"), to use the Software without restriction, including | ||
without limitation the rights to use and copy the Software. | ||
|
||
Unless permission is issued by TakeLab, this licence does not allow any | ||
person obtaining a copy of the Software to modify it in any way, merge, | ||
distribute, publish, sublicense and/or sell copies of the Software, | ||
and to permit persons to whom the Software is furnished to do so, | ||
subject to the following conditions: | ||
|
||
The above copyright notice and this permission notice shall be | ||
included in all copies or substantial portions of the Software. | ||
|
||
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, | ||
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF | ||
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND | ||
NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE | ||
LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION | ||
OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION | ||
WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
Manifest-Version: 1.0 | ||
Sealed: true | ||
Main-Class: hr.fer.zemris.ktlab.sap.gui.Main | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,27 @@ | ||
Coral | ||
===== | ||
|
||
Repozitorij za Coral | ||
This repository contains the complete code base for Coral, a parallel text | ||
alignment tool. | ||
|
||
Coral is a desktop application developed in Java, used to either manually or | ||
automatically align a parallel text, i.e. to find correspondences of segments | ||
in two translations of a text. Usually, one text is the original, and the other | ||
a translation to another language. | ||
|
||
Manual alignment is supported with an easy-to-use graphical interface, allowing | ||
the user to use drag-and-drop actions to create N to N connections between | ||
segments in both texts. Texts can also be aligned automatically using the | ||
language-agnostic Gale & Church alignment algorithm. | ||
Aligned texts can be exported to the XML-based TMX format. | ||
|
||
Coral was developed by a student team as a practical project for a course at | ||
the University of Zagreb, Croatia, but potential contributors are welcome to | ||
use the code and help expand Coral by implementing new features. | ||
|
||
To start contributing, please contact TakeLab through Github, or at: | ||
[email protected] | ||
|
||
For more information, please see: | ||
http://takelab.fer.hr | ||
http://takelab.fer.hr/coral/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,35 @@ | ||
prof | ||
dr | ||
mr | ||
sc | ||
v | ||
r | ||
d | ||
npr | ||
itd | ||
al | ||
gdin | ||
gđa | ||
gđica | ||
tj | ||
eg | ||
g | ||
gosp | ||
ing | ||
inž | ||
sv | ||
st | ||
ms | ||
mrs | ||
miss | ||
mons | ||
msgr | ||
fra | ||
o | ||
gl | ||
ur | ||
arh | ||
dir | ||
tzv | ||
usp | ||
sl |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,81 @@ | ||
a. | ||
abbr. | ||
acad. | ||
al. | ||
alt. | ||
assn. | ||
at. | ||
at. | ||
aug. | ||
ave. | ||
b. | ||
c. | ||
capt. | ||
cent. | ||
co. | ||
col. | ||
comdr. | ||
corp. | ||
cpl. | ||
d. | ||
dec. | ||
dept. | ||
dist. | ||
div. | ||
dr. | ||
e. | ||
ed. | ||
est. | ||
feb. | ||
fl. | ||
gal. | ||
gen. | ||
gov. | ||
grad. | ||
hon. | ||
i. | ||
in. | ||
inc. | ||
inst. | ||
jan. | ||
jr. | ||
lat. | ||
lib. | ||
long. | ||
lt. | ||
ltd. | ||
m. | ||
mr. | ||
mrs. | ||
ms. | ||
mt. | ||
mts. | ||
mus. | ||
n. | ||
no. | ||
nov. | ||
oct. | ||
op. | ||
p. | ||
pl. | ||
pop. | ||
prof. | ||
pseud. | ||
pt. | ||
pub. | ||
r. | ||
rep. | ||
rev. | ||
s. | ||
sen. | ||
sept. | ||
ser. | ||
sgt. | ||
sr. | ||
st. | ||
u. | ||
uninc. | ||
univ. | ||
vol. | ||
vs. | ||
wt. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,72 @@ | ||
prof | ||
dr | ||
mr | ||
sc | ||
v | ||
r | ||
d | ||
npr | ||
itd | ||
al | ||
gosp | ||
gdin | ||
gđa | ||
gđica | ||
tj | ||
eg | ||
g | ||
gosp | ||
ing | ||
inž | ||
sv | ||
st | ||
ms | ||
mrs | ||
miss | ||
mons | ||
msgr | ||
fra | ||
o | ||
gl | ||
ur | ||
arh | ||
dir | ||
tzv | ||
usp | ||
sl | ||
Prof | ||
Dr | ||
Mr | ||
Sc | ||
V | ||
R | ||
D | ||
Npr | ||
Itd | ||
Al | ||
Gosp | ||
Gdin | ||
Gđa | ||
Gđica | ||
Tj | ||
Eg | ||
G | ||
Gosp | ||
Ing | ||
Inž | ||
Sv | ||
St | ||
Ms | ||
Mrs | ||
Miss | ||
Mons | ||
Msgr | ||
Fra | ||
O | ||
Gl | ||
Ur | ||
Arh | ||
Dir | ||
Tzv | ||
Usp | ||
Sl |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,55 @@ | ||
<project name="SAP" basedir="." default="run"> | ||
<property name="src.dir" value="src"/> | ||
<property name="build.dir" value="build"/> | ||
<property name="manifest.file" value="./META-INF/MANIFEST.MF"/> | ||
<property name="jar.file" value="SAP.jar"/> | ||
|
||
<path id="src.class.path"> | ||
<pathelement path="${build.dir}"/> | ||
</path> | ||
|
||
<target name="init"> | ||
<available file="config.xml" property="config.xml.present"/> | ||
<fail unless="config.xml.present"> | ||
"config.xml" file is not present. | ||
</fail> | ||
|
||
<available file="${manifest.file}" property="manifest.mf.present"/> | ||
<fail unless="manifest.mf.present"> | ||
"${manifest.file}" file is not present. | ||
</fail> | ||
|
||
<available file="icons" type="dir" property="icons.present"/> | ||
<fail unless="icons.present"> | ||
"icons" directory is not present. | ||
</fail> | ||
|
||
<mkdir dir="${build.dir}"/> | ||
<tstamp/> | ||
</target> | ||
|
||
<target name="compile" depends="init"> | ||
<javac destdir="${build.dir}" includeAntRuntime="false" debug="true" debuglevel="lines,vars,source" encoding="UTF-8"> | ||
<src path="${src.dir}"/> | ||
<classpath refid="src.class.path"/> | ||
</javac> | ||
</target> | ||
|
||
<target name="package" depends="compile"> | ||
<jar destfile="${jar.file}" manifest="${manifest.file}"> | ||
<fileset dir="${build.dir}" includes="**/*.class"/> | ||
</jar> | ||
</target> | ||
|
||
<target name="run" depends="package"> | ||
<java jar="${jar.file}" failOnError="true" fork="true"/> | ||
</target> | ||
|
||
<target name="clean"> | ||
<delete includeEmptyDirs="true"> | ||
<fileset dir="${build.dir}"/> | ||
</delete> | ||
<delete file="${jar.file}"/> | ||
</target> | ||
|
||
</project> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
<?xml version="1.0" encoding="UTF-8" standalone="no"?> | ||
<!DOCTYPE properties SYSTEM "http://java.sun.com/dtd/properties.dtd"> | ||
<properties> | ||
<comment>config.xml</comment> | ||
<entry key="sboardShowMenu">false</entry> | ||
<entry key="fontColor2">0x000000</entry> | ||
<entry key="selectedFontColor">0xffffff</entry> | ||
<entry key="fontColor1">0x000000</entry> | ||
<entry key="lastOpenDir">C:\Documents and Settings\Igor\Desktop\bla.razor</entry> | ||
<entry key="scrollType">together</entry> | ||
<entry key="unsafeSplitColor">0xe4b452</entry> | ||
<entry key="listSelectedColor">0x000099</entry> | ||
<entry key="sboardScrollAmount">10</entry> | ||
<entry key="listColor2">0xccffff</entry> | ||
<entry key="fontSize">11</entry> | ||
<entry key="listColor1">0xffffff</entry> | ||
<entry key="markOKOnSelect">false</entry> | ||
<entry key="sboardTreshold">5</entry> | ||
</properties> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
<html> | ||
<body> | ||
<font face="Arial"> | ||
Fakultet elektrotehnike i raèunarstva Sveuèilišta u Zagrebu<br /> | ||
Zavod za elektroniku, mikroelektroniku, raèunalne i inteligentne sustave<br /> | ||
KTLab<br /> | ||
Unska 3<br /> | ||
HR-10 000 Zagreb<br /> | ||
tel. +385 1 6129 935<br /> | ||
fax. +385 1 6129 653<br /><br /><br /> | ||
|
||
|
||
Program za sravnjivanje paralelnih korpusa nastao je na Fakultetu elektrotehnike i raèunarstva Sveuèilišta u Zagrebu u akademskoj godini 2007/08 u okviru kolegija PROJEKT.<br /><br /> | ||
|
||
Program su izradili studenti Marin Akšamoviæ, Renato Dragišiæ, Ðive Jakobušiæ, Marin Japec, Vjekoslav Osmann, Željko Rumenjak, Ivan Šolta i Igor Šoš (voditelj studentskog tima) pod vodstvom prof. dr.sc. Bojane Dalbelo Bašiæ i mr.sc. Jana Šnajdera uz struènu pomoæ prof.dr.sc. Marka Tadiæa i doc.dr.sc. Sanje Seljan s Filozofskog fakulteta Sveuèilišta u Zagrebu.<br /> | ||
</font> | ||
</body> | ||
</html> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
<html> | ||
<body> | ||
<font face="Arial"> | ||
<p><b>Coral</b> (CORpus ALigner) je alat koji korisniku omogućuje olakšano sravnjivanje paralelnih korpusa, uz mogućnost ručnog ispravljanja krivo segmentiranih ili krivo uparenih rečenica.</p> | ||
<p>U program se učitava tekst na dva različita jezika. Prvotno se pomoću algoritma za segmentaciju cijeli tekst podijeli na rečenice. Prilikom dijeljenja teksta može doći do grešaka zbog posebnosti u svakom jeziku. Ako neke rečenice nisu ispravno podijeljene omogućeno je jednostavno ispravljanje greške. Pomoću algoritma za sravnavanje rečenice na jednom jeziku se povezuju sa prijevodima na drugom.</p> | ||
<p>Zbog nesavršenosti algoritma moguće su pogreške pri uparivanju, koje se mogu ručno ispraviti. Nakon što su tekstovi obrađeni, te je postignuta određena razina točnosti, rezultat rada se pohranjuje u obliku prijevodne memorije. Unutar posebnih oznaka (eng. tag) nalaze se uparene rečenice. Svaki takav blok (prijevodna jedinica) dobiva svoj identifikator koji je jedinstveno označava.</p></font> | ||
</body> | ||
</html> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
<html> | ||
<body> | ||
<font face="Arial"> | ||
<center> | ||
<font size=6><b>CORAL</b></font><br /> | ||
<font size=2>© 2008 FER</font><br /><br /> | ||
|
||
|
||
<b>Program za sravnjivanje paralelnih korpusa</b><br /><br /> | ||
</center> | ||
<br /> | ||
|
||
</font> | ||
</body> | ||
</html> |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.
Oops, something went wrong.