Compression algorithms based on bit-strings that are very simple to implement?

Ask Question

Asked 4 years, 1 month ago

Modified 4 years, 1 month ago

Viewed 107 times

Given the type of bitstrings in Haskell:

data Bits = O Bits | I Bits | Nil

Are there compression algorithms that have satisfactory results (specially when it comes to repeated substrings), that are very small and simple to implement as idiomatic recursive Haskell functions? Ideally, the algorithm would operate on bitstrings, not lists of characters; thus, it should identify repeated substrings of arbitrary lengths and positions. I'm asking for references (i.e., names/citations); no need to post the complete code.

asked Oct 7, 2021 at 19:36

MaiaVictor

53.4k47 gold badges161 silver badges309 bronze badges

1

To whoever voted to close: note that I'm not asking for software or library recommendations, I'm asking to name algorithms. Is that against the guidelines?

MaiaVictor
– MaiaVictor

2021-10-07 20:06:21 +00:00
Commented Oct 7, 2021 at 20:06
I think the question is a bit opinion-based: what does it mean for an algorithm to be "simple" and "idiomatic", and what are "satisfactory results"?

Noughtmare
– Noughtmare

2021-10-07 21:04:11 +00:00
Commented Oct 7, 2021 at 21:04
Also, do you mean consecutive repetitions or duplicated substrings in general?

Noughtmare
– Noughtmare

2021-10-07 21:26:27 +00:00
Commented Oct 7, 2021 at 21:26
@Noughtmare I mean an algorithm that can perhaps be implemented in a few recursive functions without many dependencies using mostly pattern-matching. But how else can I define that? If I just ask for a compression algorithm in general, there are many that I can easily find on Google, so this question wouldn't be needed. But I really need one that can be implemented in a few lines of normal Haskell. And I mean duplicated substrings in general, basically I want it to be able to detect repeated words and compress them.

MaiaVictor
– MaiaVictor

2021-10-08 01:04:34 +00:00
Commented Oct 8, 2021 at 1:04
Run-length encoding is very easy to define, and works well if you have long repeating strings. (en.wikipedia.org/wiki/Run-length_encoding)

alias
– alias

2021-10-08 02:02:09 +00:00
Commented Oct 8, 2021 at 2:02

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Compression algorithms based on bit-strings that are very simple to implement?

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest