"First Missing Positive" Problem: Everything You Need to Know - Code with Kamlesh-Let's Learn Programming

The “First Missing Positive” problem is a common algorithmic challenge often posed in coding interviews and technical assessments. This problem, while seemingly simple, requires a deep understanding of array manipulation, in-place sorting techniques, and efficient algorithm design to solve it optimally. In this comprehensive guide, we’ll explore the problem in detail, discuss its significance, and walk through various approaches to solving it, with a focus on finding the most efficient solution.

Table of Contents

1. Understanding the “First Missing Positive” Problem

The problem statement is as follows: Given an unsorted array of integers, find the smallest missing positive integer. The positive integer must be greater than zero, and the task is to find the first number missing in the natural number sequence starting from 1.

Examples:

Example 1:
Input: [1, 2, 0]
Output: 3
Explanation: The sequence should be 1, 2, 3, but 3 is missing.
Example 2:
Input: [3, 4, -1, 1]
Output: 2
Explanation: The sequence should be 1, 2, 3, 4, but 2 is missing.
Example 3:
Input: [7, 8, 9, 11, 12]
Output: 1
Explanation: The sequence should start with 1, which is missing.

Problem Constraints:

The solution must have a time complexity of O(n).
The space complexity should be O(1), meaning no extra array or list can be used.

2. Significance of the Problem

This problem is an excellent example of how algorithm design principles can be applied to create efficient solutions. It tests your ability to:

Optimize algorithms to run in linear time.
Use space efficiently by manipulating the input array in place.
Handle edge cases, such as arrays with no positive integers, arrays with duplicates, and arrays with all positive numbers.

The “First Missing Positive” problem is particularly important in interviews because it requires a combination of logical thinking, problem-solving skills, and a strong understanding of array operations.

3. Challenges in Solving the Problem

While the problem may appear straightforward, several challenges must be addressed to solve it optimally:

Handling Negative Numbers and Zeros: These should be ignored, as they don’t contribute to finding the smallest positive integer.
Identifying the Missing Number Efficiently: The brute-force approach of checking every number in the natural sequence is too slow and doesn’t meet the O(n) time complexity requirement.
Working Within Space Constraints: You cannot use extra space, such as additional arrays or hash tables, to track missing numbers, making it necessary to manipulate the input array directly.

4. Approaches to Solving the Problem

Let’s explore various approaches, leading up to the optimal solution.

a. Brute Force Approach

The most basic approach is to sort the array and then iterate through it to find the first missing positive number. However, this approach is inefficient due to its O(n log n) time complexity resulting from the sorting step. Additionally, it doesn’t meet the space constraints if additional storage is used.

b. Hash Set Approach

Another approach is to use a hash set to store all positive integers from the array. Then, iterate through the natural numbers starting from 1 and check which one is missing by querying the hash set. This approach has O(n) time complexity but fails the space requirement due to the use of extra space for the hash set.

c. Optimal In-Place Swap Approach

The optimal solution is based on rearranging the array such that each positive integer x (where 1 ≤ x ≤ n) is placed at the index x-1. This can be achieved through in-place swapping. After rearranging, the first index that doesn’t hold the correct number indicates the smallest missing positive integer.

Detailed Steps:

Segregate Positive and Non-Positive Numbers:

First, move all non-positive numbers to one side of the array. This step ensures that only positive numbers are considered.

In-Place Rearrangement:

For each positive number x in the array, place it at the index x-1 if possible. This step ensures that if a number x is present, it is placed at its correct index position.

Identify the Missing Positive:

After rearranging, iterate through the array. The first index i where nums[i] != i + 1 gives the smallest missing positive integer, which is i + 1.

Python Implementation:

Here’s a Python implementation of the in-place rearrangement approach:

def first_missing_positive(nums):
    n = len(nums)

    # Rearrange the array in-place
    for i in range(n):
        while 1 <= nums[i] <= n and nums[nums[i] - 1] != nums[i]:
            nums[nums[i] - 1], nums[i] = nums[i], nums[nums[i] - 1]

    # Identify the first missing positive
    for i in range(n):
        if nums[i] != i + 1:
            return i + 1

    return n + 1

Explanation:

In-Place Rearrangement: This loop ensures that each number x is placed at the index x-1 if it’s within the array’s bounds.
Final Check: The final loop identifies the first index that doesn’t match the expected value. If all positions are correct, the smallest missing positive integer is n + 1.

5. Edge Cases and Considerations

When implementing the solution, it’s important to consider edge cases:

Empty Array: If the array is empty, the smallest missing positive integer is 1.
Array with Only Non-Positive Numbers: If the array contains only zeros or negative numbers, the smallest missing positive is 1.
Array with All Positive Integers in Sequence: If the array contains all consecutive positive integers starting from 1, the answer is n + 1, where n is the size of the array.

6. Why This Solution is Optimal

This solution is optimal for several reasons:

Time Complexity: It runs in O(n) time, as it only makes a few passes through the array.
Space Complexity: It uses O(1) extra space, as the rearrangement is done in place without using additional data structures.
Simplicity and Elegance: The approach is simple yet powerful, leveraging array indices to track the presence of numbers efficiently.

7. Conclusion

The “First Missing Positive” problem is a great example of how careful algorithm design can lead to efficient solutions. By understanding the constraints and leveraging in-place array manipulation, we can solve the problem in optimal time and space complexity. This problem not only tests your understanding of arrays and sorting but also challenges you to think critically about how to achieve the best possible performance with minimal resources.

Whether you’re preparing for a coding interview or looking to sharpen your algorithmic skills, mastering the “First Missing Positive” problem is an excellent step toward becoming a more proficient problem solver. Remember, the key to success lies in understanding the problem deeply, considering various approaches, and refining your solution to meet the constraints efficiently.

FAQs: First Missing Positive

1. What is the “First Missing Positive” problem?
The “First Missing Positive” problem is a common algorithmic challenge where you are given an unsorted array of integers, and the task is to find the smallest missing positive integer. The solution should be efficient, with a time complexity of O(n) and a space complexity of O(1).

2. Why is the “First Missing Positive” problem important?
This problem is important because it tests your ability to optimize both time and space complexity in algorithms. It’s commonly used in coding interviews to assess a candidate’s problem-solving skills, particularly their ability to handle edge cases and work within strict constraints.

3. What are the main challenges in solving the “First Missing Positive” problem?
The main challenges include handling negative numbers and zeros, identifying the missing positive number efficiently without sorting, and working within the space constraint of O(1), which means no extra arrays or hash tables can be used.

4. What is the brute-force approach to solving this problem?
The brute-force approach involves sorting the array and then iterating through it to find the first missing positive integer. However, this method is inefficient with a time complexity of O(n log n) due to sorting, and it doesn’t meet the optimal space complexity requirement.

5. How does the optimal in-place swap approach work?
The optimal approach involves rearranging the array such that each positive integer x is placed at index x-1. After rearranging, you iterate through the array to find the first index where the number doesn’t match its position. This index gives the smallest missing positive integer.

6. What is the time complexity of the optimal solution?
The time complexity of the optimal solution is O(n), as the algorithm only makes a few passes through the array.

7. What is the space complexity of the optimal solution?
The space complexity of the optimal solution is O(1), as the array is modified in place without using any additional data structures.

8. What should the solution return if the array is empty?
If the array is empty, the solution should return 1 as the first missing positive integer.

9. How does the solution handle arrays with only non-positive numbers?
If the array contains only non-positive numbers (zeros or negative numbers), the solution will return 1 as the first missing positive integer since 1 is the smallest positive integer and is missing.

10. What if all positive integers from 1 to n are present in the array?
If the array contains all consecutive positive integers from 1 to n, the solution will return n + 1, where n is the size of the array, as this would be the next smallest missing positive integer.

11. Why can’t we use additional data structures like hash sets in the optimal solution?
Using additional data structures like hash sets would increase the space complexity to O(n), which violates the problem’s constraint of using O(1) extra space. The optimal solution achieves efficiency by manipulating the array in place.

12. What are some common edge cases to consider?
Common edge cases include an empty array, an array with no positive numbers, an array with all positive numbers in sequence, and arrays with large gaps between positive numbers.

13. Can this problem be solved using a hash table?
Yes, the problem can be solved using a hash table to track the presence of each positive number. However, this approach uses O(n) extra space, which is less optimal compared to the in-place approach that uses O(1) space.

14. Is the “First Missing Positive” problem applicable in real-world scenarios?
Yes, the concepts of finding missing elements in sequences or datasets are common in various applications, such as database management, inventory tracking, and data analysis. Understanding how to efficiently find missing elements is valuable in these contexts.

15. What is the key takeaway from solving the “First Missing Positive” problem?
The key takeaway is the importance of optimizing both time and space complexity in algorithm design. This problem teaches how to efficiently manipulate arrays and handle edge cases, which are crucial skills in software development and algorithmic problem-solving.

Also Read: Two Sum II – Input Array Is Sorted | Two Sum II – Input Array Is Sorted LeetCode Solution in Java

Select a plan

Monthly plan

Yearly plan

All plans include

Search for an article