Text and Data Mining
Last Updated: April 2026
ETFLIN is committed to advancing scientific knowledge through automated insights and standardized metadata harvesting. By providing robust support for Text and Data Mining (TDM) and the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), we enable researchers to unlock patterns and trends across large volumes of research content.
Text and Data Mining (TDM)
Text and Data Mining involves the use of automated algorithms to analyze large datasets, allowing for the rapid discovery of valuable insights that might be missed through manual reading. ETFLIN facilitates this by ensuring our content is machine-readable and structured for computational analysis.
Our TDM usage is governed by the following conditions:
Non-Commercial Use: TDM activities are permitted strictly for research and educational purposes. Usage for commercial applications or for-profit ventures requires explicit written permission from ETFLIN.
Proper Attribution: All research findings or datasets derived from TDM must credit the original authors and the specific ETFLIN source publication accurately.
Integrity and Ethics: The underlying content must not be altered during the mining process. Furthermore, researchers must comply with global data privacy and confidentiality regulations when processing results.
Secure Archiving: We maintain our articles in accessible, standardized formats (such as XML and PDF) to provide a reliable long-term foundation for automated analysis.
Metadata Harvesting via OAI-PMH
ETFLIN fully supports the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). This standardized protocol allows library systems, search engines, and research databases to efficiently "harvest" metadata, such as article titles, abstracts, author names, and publication dates, across our entire portfolio.
By implementing OAI-PMH, we ensure that research published with ETFLIN is highly discoverable and easily integrated into global scholarly networks. This increases the visibility of our authors' work and facilitates cross-disciplinary meta-analyses.
Technical Access: Each ETFLIN journal provides a dedicated OAI endpoint. For example, researchers interested in the Sciences of Pharmacy journal can access the metadata feed at:
https://etflin.com/sciphar/oai
Rate Limiting and System Fair Use
To ensure the stability and performance of our digital infrastructure for all users, we implement a Fair Use Policy regarding automated access. While we encourage the use of TDM and OAI-PMH, we ask that researchers and automated harvesters observe the following technical etiquette:
Request Throttling: Please configure your scripts to include reasonable delays between requests. High-frequency automated "scraping" that mimics a Distributed Denial of Service (DDoS) attack may result in temporary IP suspension to protect server integrity.
User-Agent Identification: We recommend that automated tools include a clear "User-Agent" header in their requests, providing a contact email or project description. This allows our technical team to reach out and provide assistance or dedicated access if your research requires high-volume data retrieval.
API Usage: Where available, we encourage using our official API endpoints rather than direct web scraping, as this provides a more stable and efficient data transfer method for both parties.
Continuous Evolution of Standards
ETFLIN periodically reviews these technical policies to remain aligned with evolving best practices in open-access publishing and computational research. We are dedicated to maintaining a balance between high-speed data accessibility and the long-term security of our digital archives, ensuring that the version of record remains protected while being fully available for the next generation of scientific discovery.