Skip to content
Snippets Groups Projects
Select Git revision
  • feature/sso
  • master default protected
  • release-1.10 protected
  • dev protected
  • 549-test-oai-pmh
  • 545-saving-multiple-times-breaks-pid-metadata
  • release-1.9 protected
  • 499-standalone-compute-service-2
  • 539-load-tests
  • hotfix/helm-chart
  • luca_ba_new_interface
  • 534-bug-when-adding-access-to-user-that-is-not-registered-at-dashboard-service
  • release-1.8 protected
  • 533-integrate-semantic-recommendation
  • feature/openshift
  • 518-spark-doesn-t-map-the-headers-correct
  • 485-fixity-checks
  • 530-various-schema-problems-with-subsets
  • release-1.7 protected
  • fix/auth-service
  • fix/pid-list
  • v1.10.0-rc13 protected
  • v1.10.0-rc12 protected
  • v1.10.0-rc11 protected
  • v1.10.0-rc10 protected
  • v1.10.0-rc9 protected
  • v1.10.0-rc8 protected
  • v1.10.0-rc7 protected
  • v1.10.0-rc6 protected
  • v1.10.0-rc5 protected
  • v1.10.0-rc4 protected
  • v1.10.0-rc3 protected
  • v1.10.0-rc2 protected
  • v1.10.0rc1 protected
  • v1.10.0rc0 protected
  • v1.10.0 protected
  • v1.9.3 protected
  • v1.9.2 protected
  • v1.9.2-rc0 protected
  • v1.9.1 protected
  • v1.9.0 protected
41 results

install.sh

Blame
  • guesslang.py 1018 B
    #!/usr/bin/python3
    
    from guess_language import guess_language
    from argparse import ArgumentParser, FileType
    import sys
    
    __authors__ = ["GG"]
    __date__ = 20191031
    __description__ = 'A simple pdf example'
    
    def guesslang(fnm):
    
      fin= open(fnm, 'rt')
      text= fin.read()
      fin.close()
    
      pages= text.split("\f")
    
      # Note: there should not be anything behind the last Form Feed character
      lastpage= pages.pop()
      # print ('--- [lastpage] ----------------------------------------\n', lastpage)
      lastlang= guess_language(lastpage)
    
      page_num= 1
      for page in pages:
        # print ('--- [page ', page_num, '] ----------------------------------------\n', page)
        lang= guess_language(page)
        print(lang, ' ', page_num, ' ', fnm)
        page_num= page_num+1
    
      if (lastlang != 'UNKNOWN'):
        print(lastlang, ' ', page_num+1, ' ', fnm)
    
    argc= len(sys.argv)
    # print(argc)
    if argc > 1:
      for i in range(1,argc):
        guesslang(sys.argv[i])
    else:
      text= sys.stdin.read()
      lang= guess_language(text)
      print(lang, ' ', 'stdin')