Quebec’s nationwide library is shifting forward with plans to create a database of cultural and authorities content material that could possibly be used to coach synthetic intelligence techniques and enhance their understanding of Quebec society, tradition and Indigenous languages.
Bibliothèque et Archives nationales du Québec, or BAnQ, the province’s nationwide library and archives establishment, has launched the experimental section of its proposed authorities and cultural databank in French and Indigenous languages after finishing a feasibility examine earlier this 12 months.
The venture goals to deal with considerations that main generative AI techniques usually battle to offer dependable details about Quebec society, economic system and tradition due to the restricted quantity of Quebec-related knowledge out there to them.
“All situations are a little bit bit on the desk proper now,” Valérie D’Amour, who led the feasibility examine, stated in an interview. “We have now a whole lot of concepts and we wish to validate the probabilities with cultural stakeholders, in addition to with knowledge homeowners and suppliers, who will likely be concerned within the discussions.”
BAnQ says the long run platform wouldn’t function a public distribution channel for artistic works and that entry to the info can be tightly managed.
Marie Grégoire, president and chief govt officer of BAnQ, stated the purpose is to make sure that AI techniques higher mirror Quebec society and tradition.
“Which means having Quebec references, whether or not in small fashions or massive fashions, whether or not they come from analysis or from the enterprise group,” she stated.
Comparable initiatives have emerged elsewhere, together with in Sweden, the place massive collections of Nordic-language texts have been assembled to assist develop generative AI fashions for Scandinavian languages.
BAnQ plans to start with its personal collections earlier than contemplating knowledge from different sources.
The initiative stems from a suggestion made in a 2024 report by Quebec’s innovation council. The report attributed the issue partially to the “very small amount of information on Quebec” out there in AI coaching datasets.
Future Tchéhouali, co-holder of a Quebec-based analysis chair targeted on French-language synthetic intelligence and digital applied sciences, stated Quebec tradition stays “underrepresented within the corpora presently circulating within the AI world.”
“And we run the danger of reproducing linguistic biases and cultural biases. And after we additionally speak about Indigenous peoples, we run a fair larger danger of all these biases,” stated Tchéhouali, a professor within the communications division at Université du Québec à Montréal.
He stated the proposed database would characterize “strategic infrastructure” that would assist set up pointers for a way native content material is recognized, catalogued and tracked inside immediately’s AI techniques.
Copyright considerations have emerged as a serious situation for the cultural sector as BAnQ develops the proposed database.
However Grégoire argued the proposed platform might provide creators larger safety than the present system. “Proper now, it’s a bit just like the Wild West,” she stated. “Knowledge is being harvested at no cost, and that shouldn’t be the case.”
She stated the database might act as a centralized gateway that may make it simpler to compensate creators whose works are used.
Grégoire stated that by working collectively, cultural organizations can be higher positioned to make sure creators are paid and that the sector stays sustainable over the long run.
Nonetheless, some artists fear that contributing their work to AI coaching techniques might finally undermine their very own livelihoods.
“The principle criticism we hear within the discipline is that, even when artists earn earnings from it, they’re nonetheless feeding the beast that may finally be used to switch contracts they could lose due to AI,” stated Maxime Harvey, a postdoctoral researcher on the Nationwide Institute of Scientific Analysis and a member of the identical analysis chair.
The feasibility examine envisions the platform changing into operational by 2029, though D’Amour stated the timeline will likely be reassessed following the experimental section.
The examine estimates a five-year funds of practically $10.5 million by means of 2030, together with working and capital prices. BAnQ has obtained $340,000 from the Quebec authorities for the feasibility examine and an additional $750,000 to help the venture’s 12-month experimentation section.
–This report by La Presse Canadienne was translated by CityNews




