您的位置:首页 > 产品设计 > UI/UE

Machine Translation Useful Links: Techniques, Toolkits, Videos (机器翻译中的有用链接:相关技术、工具和视频)

2011-12-02 18:39 791 查看
MachineTranslationUsefulLinks:Techniques,Toolkits,Videos

TianLiang
2011-12-2

Therearesomeusefullinksaboutthemachinetranslationresearchfield,whichyoucanreferforyourresearchworkorstudy.Herewegiveyousomeshortdescriptionsaboutthefunctionoftherelated
toolkitsaswellastherelatedURL.Formoredetails,youcanrefertothelinks.



Rule-basedMachineTranslationToolkits:


(1)Apertium:

ShortDescription:Afree/open-sourcerule-basedmachinetranslationplatform.

URL:http://www.apertium.org/

(2)OpenLogos:

ShortDescription:It’sanopensourceportoftheLogosmachinetranslationsystemforLinux.

URL:
http://en.wikipedia.org/wiki/OpenLogos

(3)Matxin:

ShortDescriptions:It’sanopen-sourcetransfermachinetranslationengine.

URL:http://matxin.sourceforge.net/


Example-basedMachineTranslationToolkits:

(1)Marclator:

ShortDescription:Marclatorisafreeexample-basedmachinetranslationsystembasedonthemarkerhypothesis,comprisingamarker-drivenchunk,acollectionofchunkaligners,
andasimpleproof-of-conceptmonotonicrecombinationor"decoder".

URL:http://www.openmatrex.org/marclator/

(2)PBMBMT:

ShortDescription:APhrase-basedMemory-basedMachineTranslationsystem,basedonmemory-basedclassifiers.

URL:http://ilk.uvt.nl/mbmt/pbmbmt/

(3)OpenMaTrEx:

ShortDescription:It’safree/open-sourcemarker-drivenexample-basedmachinetranslationsystem.

URL:http://openmatrex.org/



Statistical-basedMachineTranslationToolkits:


(1)cdec:

ShortDescription:It’sasoftwareusedfordecodingandalignment.

URL:http://cdec-decoder.org/

(2)Ncode:

ShortDescription:It’sanopensourcestatisticalmachinetranslationsystembasedonbilingualn-grams

URL:http://www.limsi.fr/Individu/jmcrego/bincoder/

(3)DoMY™CE

ShortDescription:DoMY™CEcombinesthebestopensourceSMTtranslationsoftwareintooneeasy-to-installpackage

URL:http://www.precisiontranslationtools.com/index.php?option=com_content&view=article&id=1&Itemid=22

(4)Phramer:

ShortDescription:Anopen-sourcestatisticalphrase-basedMTdecoder.

URL:
http://www.hlt.utdallas.edu/~marian/phramer/

(5)Joshua:

ShortDescription:Joshuaisanopen-sourcestatisticalmachinetranslationdecoderforhierarchicalandsyntax-basedmachinetranslation,writteninJava

URL:http://joshua.sourceforge.net/Joshua/Welcome.html

(6)Moses:

ShortDescription:ThemostusedSMTsystem,includingphrase-basedandtree-basedmodels.

URL:
http://www.statmt.org/moses/

(7)Thot:

ShortDescription:Itisatoolkittotrainphrase-basedmodelsforstatisticalmachinetranslation.Thotallowstoestimatephrase-basedmodelsandtoobtainthebestalignments
atphraselevelforagivensetofsentencepairs.

URL:http://sourceforge.net/projects/thot/


CombinedMachineTranslationSystem:

(1)Anusaaraka:

ShortDescription:AnusaarakaisanEnglish-Hindilanguageaccessingsoftware.ItisamachinetranslationtoolwithinsightsfromPanini'sAshtadhyayi(Grammarrules);
andaimsatthefusionoftraditionalIndianshastrasandadvancedmoderntechnologies.

URL:http://anusaaraka.iiit.ac.in/

(2)MANY:

ShortDescription:It’sMTsystemcombinationsystem.

URL:http://code.google.com/p/many/

(3)MEMT:

ShortDescription:Systemcombination;won6/8languagepairsinWMT11.

URL:
http://kheafield.com/code/

(4)Cunei:

ShortDescription:Cuneiisadata-drivenplatformformachinetranslation.

URL:http://www.cunei.org/


AlignmentTools:

(1)ABBYYAligner:

ShortDescription:ABBYYAlignerisaprofessionaltoolforaligningparalleltextsandcreatingTranslationMemorydatabases.Thiseasy-to-useandconvenientsoftware
accuratelyfindsmatchingsegmentsinparalleltextsandallowssavingthemintoTMXfilesforfurtheruseinCAT-toolsorintoRTFfiles.BasedonABBYY'sadvancedlinguistictechnology,ABBYYAlignerensuresexcellentqualityofparalleltextalignment.Thesoftware
hasanintuitiveinterfaceandwidefunctioncapabilitiesforquickandefficientwork.

URL:http://www.abbyy.com/aligner/

(2)GIZA++:

ShortDescription:GIZA++isastatisticalmachinetranslationtoolkitthatisusedtotrainIBMModels1-5andanHMMwordalignmentmodel.

URL:http://code.google.com/p/giza-pp/

(3)Anymalign:

ShortDescription:Anymalignisamultilingualsub-sententialaligner.Itcanextractlexicalequivalencesfromsentence-alignedparallelcorpora.Itsmainadvantageover
othersimilartoolsisthatitcanalignanynumberoflanguagessimultaneously

URL:http://www.limsi.fr/Individu/alardill/anymalign/

(4)Hualign:

ShortDescription:hunalignalignsbilingualtextonthesentencelevel.Itsinputistokenizedandsentence-segmentedtextintwolanguages.Inthesimplestcase,itsoutputis
asequenceofbilingualsentencepairs

URL:http://mokk.bme.hu/en/resources/hunalign/

(5)Araya:

ShortDescription:BilingualalignmentandalignmenteditorcreatingTMXfiles

URL:http://www.heartsome.de/en/araya.php#TMX

(6)MGIZA++:

ShortDescription:AwordalignmenttoolbasedonfamousGIZA++,extendedtosupportmulti-threading,resumetrainingandincrementaltraining.

URL:http://sourceforge.net/projects/mgizapp/

(7)BerkeleyWordAligner:

ShortDescription:TheBerkeleyWordAlignerisastatisticalmachinetranslationtoolthatautomaticallyalignswordsinasentence-alignedparallelcorpusinsupervised
andunsupervisedways.

URL:http://code.google.com/p/berkeleyaligner/

(8)PostCAT:

ShortDescription:ThispackagecontainscodetoperformwordalignmentusingIBMmodel1,2andtheHMMmodel,usingbothEMtotrainandalsousingconstrainedEMwithagreementconstraintsandsubstochasticconstraints.

URL:http://www.seas.upenn.edu/~strctlrn/CAT/CAT.html

(9)BIA:

ShortDescription:Asuiteconsistingofadiscriminativephrase-basedalignmentdecoderbasedonlinearalignmentmodels,alongwithtrainingandtuningtools.Inthe
trainingphase,relativelinkprobabilitiesarecalculatedbasedonaninitialalignment.ThetuningofthemodelweightsmaybeperformeddirectlyaccordingtoMTmetrics.

URL:http://code.google.com/p/bia-aligner/

(10)RegAligner:

ShortDescription:ItisanadequatereplacementforGIZA++asthemodelsIBM-1,2,3,4andHMMareimplemented.

URL:
https://github.com/Thomas1205/RegAligner

(11)TreeAligner:

ShortDescription:Astatisticaltree-to-treealigner,whichcanbeusedfortheautomaticgenerationofparalleltreebanks.

URL:http://www.ventsislavzhechev.eu/Home/Software/Software.html

(12)CorpusFiltergraph:

ShortDescription:Statisticalmachinetranslationsupporttoolboxtoextract,filter,alignandtransformtextdatafrommultilingualdocumentsintoparalleltrainingcorpora.

URL:http://sourceforge.net/projects/corpfiltergraph/

(13)tree-alignment-visualizer:

ShortDescription:Themajorityofexistingtoolshasbeencreatedforthevisualizationofbasicwordalignmentsaswellasformanualannotationofsentencepairs.Atthe
sametime,thegrowinginterestintheresearchcommunityinsyntaxaugmentedmachinetranslationmakesthesimultaneousvisualizationofalignmentlinksandparsetreesincreasinglyimportant.Weprovideavisualizationtoolforthispurpose,whichgivinginsight
inthedatafacilitatestheresearchtowardbettertranslationsystems.

URL:http://code.google.com/p/tree-alignment-visualizer/


EvaluationTools:

(1)NIST/BLEUConfidenceEstimator:

ShortDescription:ConfidenceIntervalEstimationforMTEvaluations.

URL:http://projectile.sv.cmu.edu/research/public/tools/bootStrap/tutorial.htm

(2)EvalTrans:

ShortDescription:AToolfortheautomaticandmanualevaluationoftranslations

URL:http://www-i6.informatik.rwth-aachen.de/web/Software/EvalTrans/index.html

(3)ROUGE:

ShortDescription:Recall-OrientedUnderstudyforGistingEvaluation(ROUGE),isasetofmetricsandasoftwarepackageusedforevaluatingautomaticsummarizationandmachine
translationsoftwareinnaturallanguageprocessing.Themetricscompareanautomaticallyproducedsummaryortranslationagainstareferenceorasetofreferences(human-produced)summaryortranslation.

URL:
http://berouge.com/default.aspx

(4)Hierson:

ShortDescription:It’satoolforautomaticerrorclassificationbasedonLevenstheindistance,precisionandrecall

URL:http://www.dfki.de/~mapo02/hjerson/

(5)SymEval:

ShortDescription:SymEvalisatranslationevaluationtoolkitthatallowsyoutocompareandscoretranslations.Allyouneedissourcetextandtranslatedtextsthatyou
wouldliketoevaluate.

URL:http://sourceforge.net/apps/mediawiki/symeval/index.php?title=Main_Page

(6)METEROR:

ShortDescription:It’sanautomatedMetricandToolkitforMTEvaluation.

URL:http://www.cs.cmu.edu/~alavie/METEOR/

(7)TEROM:

ShortDescription:TERCOMisanimplementationoftheTranslationErrorRate,whichisanerrormetricformachinetranslationthatmeasuresthenumberofeditsrequired
tochangeasystemoutputintooneofthereferences.

URL:
http://nlp.cs.qc.cuny.edu/snover/

(8)TERcpp:

ShortDescription:ThistoolismadetoscoremachinetranslationperformancewiththeTERmetric.ThiscodeisbasedonSnover'salgorithm.

URL:http://sourceforge.net/projects/tercpp/

(9)Mteval:

ShortDescription:ImplementationofBLEUandNISTMTevaluationmetrics.

URL:http://jaguar.ncsl.nist.gov/mt/resources/mteval-v13a-20091001.tar.gz

(10)MultiEval:

ShortDescription:MachineTranslationEvaluationToolkit(BLEU,METEOR,TER).

URL:https://github.com/jhclark/multeval

(11)NIST:

ShortDescription:AnopenMTevaluation.

URL:http://www.itl.nist.gov/iad/mig//tests/mt/


LanguageModels:

(1)IRSTLM:

ShortDescription:Acollectionofimplementedalgorithmsanddatastructuressuitabletoestimate,store,andaccessverylargeLMs

URL:http://hlt.fbk.eu/en/irstlm

(2)KenLM:

ShortDescription:KenLMisalibrarythatloadslanguagemodelfilesandreturnsprobabilities.

URL:http://kheafield.com/code/kenlm/

(3)RandLM:

ShortDescription:Thisprojectsdealswithspace-efficientngram-basedlanguagemodelsbuiltusingrandomizedrepresentations.

URL:http://randlm.sourceforge.net/

(4)SRILM:

ShortDescription:Atoolkitforbuildingandapplyingstatisticallanguagemodels

URL:http://www.speech.sri.com/projects/srilm/



Part-of-SpeechTaggers:


(1)MXPOST:

ShortDescription:MXPOSTwasdevelopedbyAdwaitRatnaparkhiaspartofhisPhDthesis.ItisaJavaimplementationofamaximumentropymodel.Itcanbetrainedforany
languagepairforwithannotatedPOSdataexists.

URL:ftp://ftp.cis.upenn.edu/pub/adwait/jmx/jmx.tar.gz


(2)TreeTagger:

ShortDescription:TreeTaggerisatoolforannotatingtextwithpart-of-speechandlemmainformation.

URL:http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/


SyntacticParsers:

(1)BerkeleyParser:

ShortDescription:Theparserfocusesonlearningprobabilisticcontext-freegrammars(PCFGs)whichassignasequenceofwordsthemostlikelyparsetree.Theparsersupports
avarietyoflanguagesandachievesstate-of-the-artperformanceonmostofthem.Thelanguagesare:English,Bulgarian,Arabic,Chinese,French,andGerman.

URL:http://code.google.com/p/berkeleyparser/

(2)BitPar:

ShortDescription:BitParisaparserforhighlyambiguousprobabilisticcontext-freegrammars(suchastreebankgrammars).BitParusesbit-vectoroperationstospeedup
thebasicparsingoperationsbyparallelization.

URL:
http://www.ims.uni-stuttgart.de/tcl/SOFTWARE/BitPar.html

(3)Collins:

ShortDescription:It’sthefirststatisticalparseraspartofMichaelCollins’sPhDthesis,whichalsorequirestheinstallationofMXPOST.

URL:http://www.cs.columbia.edu/~mcollins/

(4)GenPar:

ShortDescription:Itprovidesanarchitecture,adesign,andanimplementationofanintegratedsystemforstatisticalmachinetranslationbyparsing.

URL:http://nlp.cs.nyu.edu/GenPar/

(5)LoPar:

ShortDescription:LoParisanimplementationofaparserforhead-lexicalizedprobabilisticcontext-freegrammars,whichcanbealsousedformorphologicalanalysis.

URL:http://www.ims.uni-stuttgart.de/tcl/SOFTWARE/LoPar.html


StudyVideos:

(1)Phrase-basedandfactoredstatisticalmachinetranslationvideos:

ShortDescription:It’salectureprovidedbyPhilippKoehn.

URL:http://videolectures.net/aerfaiss08_koehn_pbfs/

(2)VideoandLectures(视频教程大全):

ShortDescription:It’sacollectionofdifferentvideosonprogramminglanguage.

URL:
http://www.spjc8.com/

(3)Boobooke(播布客):

ShortDescription:Therearemanylanguagestudylecturesonthiswebpage.

URL:http://www.boobooke.com/index.html
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: 
相关文章推荐