Day #7 Ingressive For Good 10 Days Codding Challenge Experience

Sep 27, 2022
0 Comments
429 views
0 favorites

The seventh day of the Algorithm implementation coding challenge for Ingressive for Good #I4G10DaysOfCodeChallenge, experience reflection.

Today's challenge was about checking whether the collection of integers forms a valid UTF8 string or not.

The challenge title was "UTF-8 Validation." Let me give you all the question descriptions so you can understand it better. If you need more description about the challenge, you can find more detail here.

Given an integer array data representing the data, return whether it is a valid UTF-8 encoding (i.e. it translates to a sequence of valid UTF-8 encoded characters).

A character in UTF8 can be from 1 to 4 bytes long, subjected to the following rules:

For a 1-byte character, the first bit is a 0, followed by its Unicode code.
For an n-bytes character, the first n bits are all one's, the n + 1 bit is 0, followed by n - 1 bytes with the most significant 2 bits being 10.

This is how the UTF-8 encoding would work:

     Number of Bytes   |   UTF-8 Octet Sequence (binary)
   --------------------+-----------------------------------------
            1          |   0xxxxxxx
            2          |   110xxxxx 10xxxxxx
            3          |   1110xxxx 10xxxxxx 10xxxxxx
            4          |   11110xxx 10xxxxxx 10xxxxxx 10xxxxxx

x denotes a bit in the binary form of a byte that may be either 0 or 1.

Note: The input is an array of integers. Only the least significant 8 bits of each integer is used to store the data. This means each integer represents only 1 byte of data.

The Solution

I took the simplest approach to solve the challenge, but I believe it can be improved more. What I did was straightforward. Here is the algorithm:

Loop over each element of the list

Initiate an iterator variable from zero
Get the element on the iterator^th position of the list
Convert it to a string representation of its binary code
Count the number of 1s in front
Check if all the elements represented by the number of 1s in the first element all start by 10
1. If yes, advance to the (iterator + number of 1s)^th element and repeat the steps 2 - 5
2. If not, return False