Files
drc-ners-nlp/notebooks
Amaury Cansa 4874b178c9 Name Analysis (#9)
* feat: implement representative sampling by province (~500k records), extract surnames from the first token of name, build letter transition matrices (frequency and probability), add heatmap visualization for transitions, and integrate a Markov chain–based name generator.

* Implemented letter frequency analysis with histograms, computed bigram and trigram frequencies, and displayed the top results in tabular format. Rebuilt the transition probability matrix, and developed a name generator capable of producing realistic outputs based on surname data.
2025-09-24 20:23:40 +02:00
..
2025-09-21 16:23:44 +02:00
2025-09-24 20:23:40 +02:00
2025-09-24 20:23:40 +02:00
2025-09-21 16:23:44 +02:00
2025-09-21 16:23:44 +02:00