Authors:
            
                    Ahmad R. Shahid
                    
                        
                    
                     and
                
                    Dimitar Kazakov
                    
                        
                    
                    
                
        
        
            Affiliation:
            
                    
                        
                    
                    The University of York, United Kingdom
                
        
        
        
        
        
             Keyword(s):
            Multilingual Lexicons, Web Crawler,Wikipedia, Natural Language Processing, Web mining, Data mining.
        
        
            
                Related
                    Ontology
                    Subjects/Areas/Topics:
                
                        Applications
                    ; 
                        Artificial Intelligence
                    ; 
                        Computational Intelligence
                    ; 
                        Data Mining
                    ; 
                        Databases and Information Systems Integration
                    ; 
                        Enterprise Information Systems
                    ; 
                        Evolutionary Computing
                    ; 
                        Knowledge Discovery and Information Retrieval
                    ; 
                        Knowledge Engineering and Ontology Development
                    ; 
                        Knowledge-Based Systems
                    ; 
                        Machine Learning
                    ; 
                        Natural Language Processing
                    ; 
                        Pattern Recognition
                    ; 
                        Sensor Networks
                    ; 
                        Signal Processing
                    ; 
                        Soft Computing
                    ; 
                        Symbolic Systems
                    
            
        
        
            
                Abstract: 
                This paper proposes a method for creating a multilingual dictionary by taking the titles of Wikipedia pages in English and then finding the titles of the corresponding articles in other languages. The creation of such multilingual dictionaries has become possible as a result of exponential increase in the size of multilingual information on the web. Wikipedia is a prime example of such multilingual source of information on any conceivable topic in the world, which is edited by the readers. Here, a web crawler has been used to traverse Wikipedia following the links on a given page. The crawler takes out the title along with the titles of the corresponding pages in other targeted languages. The result is a set of words and phrases that are translations of each other. For efficiency, the URLs are organized using hash tables. A lexicon has been constructed which contains 7-tuples corresponding to 7 different languages, namely: English, German, French, Polish, Bulgarian, Greek and Chinese.