Release of FAPEC Archiver 23.0 and WinFAPEC 1.0

FAPEC 23 logoicon

We have just released the new version of our professional data compression software, FAPEC Archiver 23.0.

For the first time, we are also including a graphical front-end: WinFAPEC 1.0, for Microsoft Windows.

As in previous versions, the software can be downloaded from our website, at this link. Without any cost, it will provide you free decompression features (yet with some limitations) of .fapec files, both in command-line and with WinFAPEC (for Windows platforms). The command-line compressor and decompressor is available for Windows, macOS (x86_64 and M1 processors) and Linux (x86_64 and ARM processors).

To compress files (either through the command-line or with WinFAPEC), you will need a valid FAPEC license. Once again, you can get free 30-days evaluation licenses from our website. For full licenses, integration in your systems and support, please contact us.

These are some snapshots of WinFAPEC 1.0:

 

As usual, FAPEC Archiver (and now also WinFAPEC) includes our professional technology:

  • High-performance, multi-threaded, versatile two-stage data compression.
  • Automatic algorithm selection from a quick analysis of each file to be compressed.
  • Professional stages: Kongsberg’s MBES data, CILLIC image and video compression, FastQ genomics compression, tabular (CSV) data…
  • Ultra-fast compression for speeds in the GB/s regime.
  • Lossy compression options for most of the stages, configurable by the user (lossless is set by default).
  • Improvements in the Wave compression of multi-channel time series, including a “smart lossy” algorithm. As shown in this paper, it allows to achieve ratios as high as 100 on digital RF data with error-free demodulation.
  • On-the-fly data analysis, allowing to obtain basic statistics from each file. It includes, for example, a “flatness map” of images (with CILLIC), or the compressibility of MBES watercolumn datagrams.
  • On-the-fly encryption with XXTEA (and AES-256 for full licenses). Error recovery from corrupted files.
  • APIs for third-party integration in C/C++, Python, Java, HDF5, FITS…

Have fun!

Release of FAPEC Archiver 22.0 – codename “Merganser”

 

Today, DAPCOM has released the new version of our propietary, high-performance, professional, staged data compressor: FAPEC Archiver 22.0 (codename Merganser).

Anybody can request and download free evaluation licenses, and for the first time, free perpetual decompressors can be downloaded.

It includes several exciting features:

  • New professional stage to compress Kongsberg’s MBES data, including their brand new KMALL (and KMWCD) files, as well as .all and .wcd files.
    • FAPEC achieves the best compression ratios in the market for those files.
    • You can find more information in our brochure.
  • New image compression algorithm, CILLIC (Context-Interpolation Lossless and Lossy Image Compressor), supporting greyscale, colour and hyperspectral images in various coding formats (including Bayer pattern). It offers lossless, near-lossless and lossy options.
  • New compression algorithm for multi-channel time series, Wave, offering lossless and near-lossless options. It is ideal for audio and radio-frequency (RF) data, for example.
  • New ultra-fast compression core, FASEC, for extremely fast lossless compression of single- or multi-dimensional time series or images. It can achieve compression (and decompression) speeds well above 1GB/s on a modest laptop.
  • Our usual professional stages: FastQ (genomics data), Tabular text data (such as CSV or some LIDAR and point cloud formats), LZW and improved FAPECLZ for text data, etc.
  • On-the-fly generation of basic compression statistics for each data chunk and file, which can be extended to perform quick statistical analyses on the data.
  • Multiple file and directories archival (up to 8 million files or folders), keeping dates and permissions.
  • Multi-threaded operation.
  • AES256 and XXTEA-based encryption.
  • Public API to integrate FAPEC compression or decompression in your C/C++ software.
  • Python and Java bindings, to programmatically use FAPEC from your Python or Java code.
  • Integration in HDF5 and FITS.

The codename Merganser comes from the red-breasted merganser, which is claimed to be the fastest duck. We chose a duck name due to the release number and date (2022-02-22), which contain a lot of ducks.

You can get your personal copy of FAPEC here, where you can choose between a 30-day evaluation license to test the compressor, or the free decompression program.

You can find more information on FAPEC here.

First Earth Observation images compressed by the new FAPEC-CILLIC algorithm onboard ESA’s OPS-SAT

The European Space Agency (ESA) launched OPS-SAT on 18 December 2019 into a circular, polar orbit at 515 km altitude.
OPS-SAT is a technology demonstration nanosatellite based on the cubesat standard. The satellite is only 30cm x 10cm x 10cm, but it brings powerful technology and instrumentation onboard, such as a fine Attitude Determination Control System, a GPS receiver, an X-band transmitter (up to 50 Mbps downlink) or a Software Defined Radio front-end.
It also includes a high-performance data processing platform (based on a dual-core ARM Cortex-A9 processor) and a high-resolution camera (based on an RGB Bayer filter) with a bit depth of 12 bits per pixel and a ground resolution of up to 80m x 80m per pixel.

When preparing for this mission, ESA issued a call for experimenters willing to test their technologies in orbit. One of the submitted (and accepted) experiments is the FAPEC data compression software – a versatile and efficient solution for lossless and lossy compression of images and of a large variety of instrumental data.
FAPEC is already being used by a satellite constellation to compress payload data onboard. However, its image compression capabilities had not been demonstrated in orbit yet. Furthermore, FAPEC was recently improved with a new image decorrelation algorithm (named CILLIC), presented at the ESA/CNES On-Board Payload Data Compression workshop (OBPDC) on 22 September 2020. FAPEC, with its CILLIC configuration, offers lossless and lossy image compression performances similar to those achieved by wavelet transforms but at a fraction of their computational cost.

On 28 November 2020, FAPEC was uploaded to OPS-SAT. The day after, and for the very first time in orbit, it was invoked by the payload data processing system to compress two images of our planet taken from Space.
Each of the two images, with a resolution of 4 Mpix and weighting 7.6 MB, were reduced to nearly 800 KB in about 0.8 seconds in a single computing thread.
Once received on ground, they were “de-bayered” with the ImageJ software to obtain the original colour images, which are shown below.

First FAPEC-compressed images from ESA’s OPS-SAT. The colour pictures shown here have been obtained by applying a standard de-bayer algorithm on ground, without any colour correction or white balancing in this case.
Photos: ESA/OPS-SAT

 

Shortly after this, more images were acquired and, again, compressed with FAPEC to be later downlinked to the ground station. These ones better illustrate the high resolution of the camera:

Some of the Earth Observation images acquired by ESA’s OPS-SAT and compressed by FAPEC onboard. In this case, some colour correction (white balancing) was applied.
Photos: ESA/OPS-SAT

 

With this, FAPEC has been the first user non-ESA experiment to be activated onboard OPS-SAT, achieving two important goals. First, it demonstrates the feasibility of uploading and activating new experiments onboard this technology demonstration mission. And second, thanks to the efficient collaboration with ESA and OPS-SAT experts, it can provide an interesting data compression service to other experimenters having to download many images from the satellite.

FAPEC is an excellent data compression solution especially for cubesat-based missions, which use to have strong limitations in onboard computing capabilities and in the downlink. The highly optimized, versatile and portable FAPEC software allows for an agile integration in the payload data handling system while offering a compression throughput comparable to hardware-based solutions. Even satellites with just a low-range onboard computer can use it. Science return of EO missions can significantly be increased in this way.

More information:

Disclaimer: The view expressed herein can in no way be taken to reflect the official opinion of the European Space Agency.

Gaia EDR3 bulk catalogue available in FAPEC format

The Gaia group at the University of Barcelona (IEECICCUB), in cooperation with DAPCOM, has published an alternative copy of the bulk data files from Gaia EDR3 – the Early Data Release 3 from Gaia.

Gaia EDR3 was published yesterday, 3rd December 2020. Besides the on-line catalogue, bulk CSV files were also made available for download – an interesting option for exhaustive analyses. Such files are officially offered in “csv.gz” format, that is, compressed with the widely known gzip compressor.

On 6 February 2019, we released FAPEC Archiver 19.0, our professional data compression software offering high compression ratios at high speeds. One of the options provided is the compression of tabular (CSV-like) text files, such as those from the bulk Gaia EDR3. As a service to the worldwide astronomical community, and also as a demonstration of the capabilities of FAPEC, DAPCOM and the Gaia IEEC/ICCUB Group converted the GaiaSource files from the official Gaia EDR3 bulk CSV repository into the FAPEC format, reducing the total size from 613 GB to 495 GB – that is, 19% smaller than with gzip. Other data compressors like bzip2, rar, Zstandard or 7-zip cannot reach this mark.

You can now download Gaia EDR3 in csv.fapec format here:

     Gaia EDR3 csv.fapec bulk download

The additional tables available in the bulk Gaia EDR3 catalogue will also be converted and published during the coming days.

Free FAPEC decompression licenses can be obtained from our website. Besides, we are preparing a new FAPEC release, including a freely downloadable decompressor with Python bindings.

Have fun!

Gaia DR2 bulk catalogue available in FAPEC format

The Gaia group at the Universitat de Barcelona (IEECICCUB), in cooperation with DAPCOM, has published an alternative copy of the bulk data files from Gaia DR2 – the second data release from Gaia, where DAPCOM has made significant contributions.

Gaia DR2 was published on 25 April 2018. Besides the on-line catalogue, bulk CSV files were also made available for download – an interesting option for exhaustive analyses. Such files are officially offered in “csv.gz” format, that is, compressed with the widely known gzip compressor.

On 6 February 2019, we released FAPEC Archiver 19.0, our professional data compression software offering high compression ratios at high speeds. One of the options provided is the compression of tabular (CSV-like) text files, such as those from the bulk Gaia DR2. As a demonstration of the capacities of FAPEC, we converted the full Gaia DR2 bulk CSV files to the FAPEC format, reducing the total size from 554 GB to 471 GB – that is, 15% smaller than with gzip. Other data compressors like bzip2, rar, Zstandard or 7-zip cannot reach this mark. Specifically, for the largest tables:

  • gaia_source has been reduced from 548 GB to 466 GB. We have also combined several CSV files into larger FAPEC archives to improve download transfer speeds.
  • gaia_source_with_rv, from 3.1 GB to 2.5 GB.
  • light_curves, from 2.3 GB to 1.9 GB.

You can now download Gaia DR2 in csv.fapec format here:

Gaia DR2 csv.fapec bulk download

There you will also find the scripts used for the gzip-to-fapec conversion, as well as the log files from the process, during which we checked each of the files to make sure no data was lost or corrupted.

Free FAPEC decompression licenses can now be obtained from our website.

Have fun!

 

Release of FAPEC Archiver 19.0

 

FAPEC

FAPEC Archiver 19.0 is out!

Today, DAPCOM has released the new version of our propietary, high-performance, professional, staged data compressor, FAPEC.

This version, called FAPEC Archiver 19.0, is the first public version in the sense that anybody can request and download free decompression or evaluation licenses.

It also includes some exciting improvements with respect to the previous release, such as:

  • New professional stages: FastQ (genomics data), Tabular text data (such as CSV or some LIDAR and point cloud formats), Kongsberg’s water column data.
  • LZW stage and improved FAPECLZ stage for text data, offering excellent ratios and outstanding decompression speeds on log files.
  • On-the-fly generation of basic compression statistics for each data chunk and file, which can be extended to perform quick statistical analyses on the data.
  • Multiple file and directories archival (up to 8 million files or folders), keeping dates and permissions.
  • Multi-threaded operation.
  • AES256 and XXTEA-based encryption.
  • Public API to integrate FAPEC compression or decompression in your software, available in C for now (Java/JNI and Python bindings are in the making).

Get your personal FAPEC copy here!

Gaia Data Release 2 and DAPCOM

On 25 April 2018 at 12:00 CEST, the second Gaia data release (Gaia DR2) was published.
This is a major milestone in astronomy, leading to the largest and most precise multi-dimensional map of our Galaxy: it provides positions and brightness of 1.7 billion stars (also providing distances, proper motions and colours for 80% of these), as well as 7 million stars with radial velocities, 550 thousand variable stars, 14 thousand asteroids and millions of astrophysical parameters.
The release attracted a lot of attention from press and media all over the world. In the three weeks since this publication, nearly a hundred scientific papers have been prepared for this release or using data from it. Impact in practically all aspects of astronomy is out of doubt.

DAPCOM, alumni of the ESA Business Incubation Centre (BIC) of Barcelona, has significantly contributed to this groundbreaking dataset through a contract awarded by ESA in 2015.
The so-called Cross-Match process, an essential element in the Gaia Data Processing and Analysis Consortium (DPAC), had to process over 50 billion observations (acquired during the first 22 months of the mission), reliably identifying the clusters corresponding to a same source – be it a well-behaved isolated star, a dense area in the sky, or a star with high proper motion.
Our experts have designed, implemented and operated all stages of this complex process (executed at the MareNostrum supercomputer), from the identification and filtering of spurious or parasitic detections to the final resolution based on clustering techniques. Specifically, we have adapted the recursive nearest-neighbour algorithm to properly identify the objects observed by Gaia, which do not necessarily follow a first-order rectilinear motion. One of our most remarkable contributions is the design, implementation and tuning of an adhoc decision and resolution tree. Its result is, in short, the definition of the list and features of the sources contained in the data release.
This work is still ongoing. DAPCOM is further improving and executing this cross-match process, now handling 34 months of data, aiming at the preparation of the third Gaia data release, envisaged for end 2020.

Gaia’s sky in colour

DAPCOM on TV (La 2, “Tinc una idea”)

DAPCOM appeared yesterday on the TV program “Tinc una idea” (“I’ve got an idea”) of La 2 (around minute 21:55), in catalan.

Watch it online

New FAPEC stage for tabulated text data compression

DAPCOM is about to release a new version of the FAPEC data compression software. Among its exciting new features we can find a new stage for tabulated text data, such as point clouds or CSV files. We have done some tests which reveal that FAPEC achieves the best compression ratios at a very low computing cost!

Lossless data compression ratios on tabulated text data

Lossless data compression speeds on tabulated text data