Preferred data formats
The choice of data format is important as it ensures that the data will be readable in the future. Some formats significantly improve the long-term usability of data compared to others.
Properties of preferred data formats:
-
Non-commercial: freely available and usable without the need to buy specific software or licences. This ensures wider access and long-term preservation of data, regardless of changes in the activities of commercial enterprises
-
Open, with documented international standards: Based on publicly available and standardised technical specifications that allow different systems and tools to process the data without restriction
-
Use standard character encoding: Ensures correct display of text across languages and platforms, eliminating encoding incompatibility issues. Unicode UTF-8 is a widely used character encoding standard that allows the uniform representation and exchange of text across languages
-
Uncompressed: to avoid possible data corruption or dependency on specific compression methods. Uncompressed data also facilitates processing and long-term preservation, as no additional software is needed to open or restore the data
These features help ensure that data is easily accessible, securely stored and widely usable in the future.
| File type | Preferred formats | Acceptable formats | Non-preferred formats |
|---|---|---|---|
| Text documents |
|
|
|
| Plain text | Unicode text (.txt) |
|
|
| Presentations | PDF/A (.pdf) |
|
|
| Data tables |
|
|
|
| Databases |
|
|
|
| Statistical analysis data |
|
|
|
| Audio |
|
|
|
| Video |
|
|
WMV (.wmv) |
| Images |
|
|
|
| Vector datnes |
|
EPS (.eps) |
|
| Geographical information systems (GIS) |
|
|
|
| Archives |
|
|
|
| Qualitative data analysis |
|
|
|
Preferred data formats
The choice of data format is important as it ensures that the data will be readable in the future. Some formats significantly improve the long-term usability of data compared to others.
Properties of preferred data formats:
-
Non-commercial: freely available and usable without the need to buy specific software or licences. This ensures wider access and long-term preservation of data, regardless of changes in the activities of commercial enterprises
-
Open, with documented international standards: Based on publicly available and standardised technical specifications that allow different systems and tools to process the data without restriction
-
Use standard character encoding: Ensures correct display of text across languages and platforms, eliminating encoding incompatibility issues. Unicode UTF-8 is a widely used character encoding standard that allows the uniform representation and exchange of text across languages
-
Uncompressed: to avoid possible data corruption or dependency on specific compression methods. Uncompressed data also facilitates processing and long-term preservation, as no additional software is needed to open or restore the data
These features help ensure that data is easily accessible, securely stored and widely usable in the future.
| File type | Preferred formats | Acceptable formats | Non-preferred formats |
|---|---|---|---|
| Text documents |
|
|
|
| Plain text | Unicode text (.txt) |
|
|
| Presentations | PDF/A (.pdf) |
|
|
| Data tables |
|
|
|
| Databases |
|
|
|
| Statistical analysis data |
|
|
|
| Audio |
|
|
|
| Video |
|
|
WMV (.wmv) |
| Images |
|
|
|
| Vector datnes |
|
EPS (.eps) |
|
| Geographical information systems (GIS) |
|
|
|
| Archives |
|
|
|
| Qualitative data analysis |
|
|
|