Giter Club home page Giter Club logo

Comments (2)

fabiolimace avatar fabiolimace commented on July 17, 2024

Abaixo uma lista parcial dos bigramas obtidos até o momento. O corpus usado contém 5 idiomas: en, es, fr, it, pt.

c1 c2 m1 m2 df idf
a a 0 0 1 1.6094379124341003
ã ã 0 0 1 1.6094379124341003
ã ã 0 1 1 1.6094379124341003
á b 0 0 2 0.9162907318741551
a b 0 0 5 0.0
a b 0 1 2 0.9162907318741551
a b 1 0 5 0.0
a ç 0 0 2 0.9162907318741551
á c 0 0 2 0.9162907318741551
â c 0 0 2 0.9162907318741551
a c 0 0 5 0.0
a c 0 1 4 0.22314355131420976
a ç 1 0 1 1.6094379124341003
a c 1 0 5 0.0
á d 0 0 1 1.6094379124341003
a d 0 0 5 0.0
a d 0 1 2 0.9162907318741551
a d 1 0 5 0.0
a d 1 1 1 1.6094379124341003
á e 0 0 1 1.6094379124341003
ã e 0 0 1 1.6094379124341003
a é 0 0 2 0.9162907318741551
a e 0 0 4 0.22314355131420976
a é 0 1 1 1.6094379124341003
ã e 0 1 1 1.6094379124341003
a e 0 1 2 0.9162907318741551
a e 1 0 1 1.6094379124341003
á f 0 0 1 1.6094379124341003
a f 0 0 5 0.0
a f 0 1 1 1.6094379124341003
a f 1 0 5 0.0
á g 0 0 1 1.6094379124341003
â g 0 0 1 1.6094379124341003
a g 0 0 5 0.0
a g 0 1 1 1.6094379124341003
á g 1 0 1 1.6094379124341003
â g 1 0 1 1.6094379124341003
a g 1 0 5 0.0
a h 0 0 5 0.0
a h 0 1 4 0.22314355131420976
a h 1 0 5 0.0
a h 1 1 2 0.9162907318741551
a í 0 0 1 1.6094379124341003
a î 0 0 1 1.6094379124341003
a ï 0 0 1 1.6094379124341003
á i 0 0 1 1.6094379124341003
a i 0 0 5 0.0
à i 0 1 1 1.6094379124341003
a i 0 1 3 0.5108256237659907
a i 1 0 5 0.0
a i 1 1 3 0.5108256237659907
á j 0 0 1 1.6094379124341003
a j 0 0 5 0.0
a j 1 0 5 0.0
a k 0 0 3 0.5108256237659907
a k 0 1 2 0.9162907318741551
á l 0 0 1 1.6094379124341003
â l 0 0 1 1.6094379124341003
a l 0 0 5 0.0
a l 0 1 5 0.0
a l 1 0 5 0.0
a l 1 1 2 0.9162907318741551
á m 0 0 1 1.6094379124341003
â m 0 0 1 1.6094379124341003
a m 0 0 5 0.0
a m 0 1 3 0.5108256237659907
á m 1 0 1 1.6094379124341003
â m 1 0 1 1.6094379124341003
a m 1 0 5 0.0
a m 1 1 2 0.9162907318741551
a ñ 0 0 1 1.6094379124341003
á n 0 0 1 1.6094379124341003
á ñ 0 0 1 1.6094379124341003
à n 0 0 1 1.6094379124341003
ã n 0 0 1 1.6094379124341003
a n 0 0 5 0.0
á n 0 1 1 1.6094379124341003
a n 0 1 5 0.0
a ñ 1 0 1 1.6094379124341003
á n 1 0 1 1.6094379124341003
a n 1 0 5 0.0
a n 1 1 3 0.5108256237659907
á nothing 1 1 1 1.6094379124341003
à nothing 1 1 3 0.5108256237659907
a nothing 1 1 5 0.0
á o 0 0 1 1.6094379124341003
â o 0 0 1 1.6094379124341003
ã o 0 0 1 1.6094379124341003
a o 0 0 4 0.22314355131420976
á o 0 1 1 1.6094379124341003
ã o 0 1 1 1.6094379124341003
a o 0 1 2 0.9162907318741551
a o 1 0 1 1.6094379124341003
a o 1 1 1 1.6094379124341003
á p 0 0 2 0.9162907318741551
a p 0 0 5 0.0
a p 0 1 1 1.6094379124341003
a p 1 0 5 0.0
á q 0 0 1 1.6094379124341003
a q 0 0 4 0.22314355131420976
á q 1 0 1 1.6094379124341003
a q 1 0 2 0.9162907318741551
à r 0 0 1 1.6094379124341003
á r 0 0 2 0.9162907318741551
a r 0 0 5 0.0
a r 0 1 5 0.0
á r 1 0 1 1.6094379124341003
a r 1 0 5 0.0
a r 1 1 1 1.6094379124341003
ã s 0 0 1 1.6094379124341003
á s 0 0 2 0.9162907318741551
a s 0 0 5 0.0
â s 0 1 1 1.6094379124341003
ã s 0 1 1 1.6094379124341003
á s 0 1 2 0.9162907318741551
a s 0 1 5 0.0
á s 1 0 1 1.6094379124341003
a s 1 0 5 0.0
á s 1 1 1 1.6094379124341003
a s 1 1 3 0.5108256237659907
á t 0 0 1 1.6094379124341003
ä t 0 0 1 1.6094379124341003
â t 0 0 2 0.9162907318741551
a t 0 0 5 0.0
â t 0 1 1 1.6094379124341003
a t 0 1 3 0.5108256237659907
á t 1 0 1 1.6094379124341003
a t 1 0 5 0.0
a t 1 1 1 1.6094379124341003
a ú 0 0 2 0.9162907318741551
á u 0 0 2 0.9162907318741551
a u 0 0 5 0.0
á u 0 1 1 1.6094379124341003
a u 0 1 4 0.22314355131420976
a ú 1 0 1 1.6094379124341003
a u 1 0 5 0.0
a u 1 1 1 1.6094379124341003
á v 0 0 1 1.6094379124341003
a v 0 0 5 0.0
á v 1 0 1 1.6094379124341003
a v 1 0 5 0.0
a w 0 0 2 0.9162907318741551
a w 0 1 1 1.6094379124341003
a w 1 0 1 1.6094379124341003
a w 1 1 1 1.6094379124341003
a x 0 0 3 0.5108256237659907
a x 0 1 1 1.6094379124341003
a x 1 0 1 1.6094379124341003
á y 0 0 1 1.6094379124341003
a y 0 0 5 0.0
a y 0 1 4 0.22314355131420976
a y 1 0 4 0.22314355131420976
á z 0 0 1 1.6094379124341003
a z 0 0 5 0.0
a z 0 1 2 0.9162907318741551
a z 1 0 3 0.5108256237659907

Legenda das colunas:

  • c1: primeiro caractere do bigrama
  • c2: segundo caractere do bigrama
  • m1: indica se o bigrama está na margem esquerda
  • m2: indica se o bigrama está na margem direita
  • df: document frequence
  • idf: inverse document frequence

from lincom.

fabiolimace avatar fabiolimace commented on July 17, 2024

Tarefa concluída.

Artefatos incluídos dentro do diretório lab deste repositório:

https://github.com/fabiolimace/lincom/tree/main/lab/002_listar_bigramas_caracteres

from lincom.

Related Issues (4)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.