Class LanguageProfile


  • @Deprecated
    public class LanguageProfile
    extends Object
    Deprecated.
    Language profile based on ngram counts.
    Since:
    Apache Tika 0.5
    • Field Detail

      • DEFAULT_NGRAM_LENGTH

        public static final int DEFAULT_NGRAM_LENGTH
        Deprecated.
        See Also:
        Constant Field Values
      • useInterleaved

        public static boolean useInterleaved
        Deprecated.
    • Constructor Detail

      • LanguageProfile

        public LanguageProfile​(int length)
        Deprecated.
      • LanguageProfile

        public LanguageProfile()
        Deprecated.
      • LanguageProfile

        public LanguageProfile​(String content,
                               int length)
        Deprecated.
      • LanguageProfile

        public LanguageProfile​(String content)
        Deprecated.
    • Method Detail

      • getCount

        public long getCount()
        Deprecated.
      • getCount

        public long getCount​(String ngram)
        Deprecated.
      • add

        public void add​(String ngram)
        Deprecated.
        Adds a single occurrence of the given ngram to this profile.
        Parameters:
        ngram - the ngram
      • add

        public void add​(String ngram,
                        long count)
        Deprecated.
        Adds multiple occurrences of the given ngram to this profile.
        Parameters:
        ngram - the ngram
        count - number of occurrences to add
      • distance

        public double distance​(LanguageProfile that)
        Deprecated.
        Calculates the geometric distance between this and the given other language profile.
        Parameters:
        that - the other language profile
        Returns:
        distance between the profiles