Expeditious data recovery in Not Only SQL (NoSQL) databases using a log-based approach

Gayatri Shitanshu Kapadia; Rustom Darash Morena

doi:10.7717/peerj-cs.3507

Expeditious data recovery in Not Only SQL (NoSQL) databases using a log-based approach

Gayatri Shitanshu Kapadia ¹, Rustom Darash Morena²

1M.C.A. Department, Sarvajanik College of Engineering and Technology, Surat, Gujarat, India

2Department of Computer Science, Veer Narmad South Gujarat University, Surat, Gujarat, India

DOI: 10.7717/peerj-cs.3507

Published: 2026-02-17
Accepted: 2025-12-01
Received: 2025-04-21

Academic Editor: Siddhartha Bhattacharyya

Subject Areas: Algorithms and Analysis of Algorithms, Computer Education, Databases, Blockchain
Keywords: Journal, NoSQL databases, Recovery, Oplog, Replication, Restoration, MongoDB, Transaction, Journal, MongoDB

Licence: This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication. This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.

Cite this article: Kapadia GS, Morena RD. 2026. Expeditious data recovery in Not Only SQL (NoSQL) databases using a log-based approach. PeerJ Computer Science 12:e3507 https://doi.org/10.7717/peerj-cs.3507

The authors have chosen to make the review history of this article public.

Abstract

Not Only SQL (NoSQL) databases lack standardized backup and recovery procedures, making fault tolerance and rapid restoration more challenging than in traditional relational systems. Achieving efficient recovery in distributed NoSQL environments requires mechanisms that can manage operational failures while preserving availability, consistency, and performance. This study investigates existing recovery strategies for NoSQL databases and introduces a novel log-based expeditious recovery mechanism tailored for distributed architectures. Our primary contribution is a transaction evaluator and cleaner system that performs real-time inspection of operational logs, identifies failed or inconsistent transactions, and prevents their propagation from the primary node to secondary replica nodes. Unlike conventional log-shipping and replication approaches, the proposed method integrates selective log forwarding with automated transaction sanitization, thereby reducing replica divergence, data loss, and overall recovery time. We implemented the system using Node.js and MongoDB Atlas, incorporating modules for log management, collection operations, and a lightweight journaling process for secondary nodes. Experimental evaluation shows that our approach improves recovery speed and reliability compared to standard replication-based techniques. These findings demonstrate a practical and robust solution for accelerating recovery in NoSQL databases, contributing meaningful advancement to failure-recovery research in distributed data systems.

Introduction

Not Only SQL (NoSQL) databases lack the standardized backup and recovery procedures found in relational databases, resulting in significant challenges. Accurately reflecting data at a specific point in time is problematic, especially with large data volumes that require extensive resources for backups. Recovery processes can be further hindered by potential downtime and performance issues, as NoSQL databases prioritize availability and partition tolerance over strict consistency, complicating the achievement of a consistent state across all nodes. Additionally, the risk of stale data necessitates conflict resolution.

To overcome potential downtime and performance issues, a proposed methodology involves a transaction evaluator and cleaner system built in Node.js and integrated with MongoDB Atlas. Out of various NoSQL databases, MongoDB is a prominent NoSQL database with 30% to 40% market share and a compound annual growth rate (CAGR) of 23.9% (Ref. Market Share, 2024). Hence, we selected MongoDB amongst the various NoSQL databases to implement our proposed system, which in turn can be applied to other NoSQL databases. This system uses MongoDB data structures like Journal File, Operation Log (Oplog), and Checkpoint, which play crucial roles in transaction evaluation. An oplog is a log that records all operations applied to a database. A journal is a write-ahead log (WAL) used for crash recovery. A checkpoint is a point where the database flushes recent changes from memory (cache) to disk. The Oplog, journal, and checkpoint are discussed later in this section.

The proposed system includes scripts for critical tasks such as connecting to databases, managing collections, and handling operational logs while replicating the database operation(s) of successful transactions from the primary node to secondary node(s). We have written two algorithms, one of which is a transaction evaluator and cleaner that bifurcates successful and failed transactions and saves successful transactions in a newly created oplog. Another algorithm is to minutely work on identifying the successful transactions, which is a subpart of the first algorithm. By providing newly created operation logs and incorporating journaling during various database operations at replica set levels, the proposed method seeks to ensure reliable and efficient restoration of the database in less time than the normal restore process. The proposed system achieves minimum downtime in the recovery process with no data loss.

MongoDB, which operates in a distributed environment, has the following key components, which are part of our proposed system.

Primary node

In a MongoDB replica set, the primary node acts as the reliable source of data for the set. It is the sole node in the replica set authorized to handle write operations. It’s the sole node that can handle insert, update, and delete commands. The primary holds an operation log (oplog), a unique capped collection that keeps all changes to data.

Oplog file

The oplog is a special capped collection that is used to percolate all changes to the data from the primary node to secondary nodes in the MongoDB replica set. Each member of the replica set applies the operations from the oplog to maintain consistency across the set. The oplog is crucial for monitoring and recovering transactions in a distributed environment. It ensures that transactions are consistently applied across all nodes in the replica set. In the event of a failure, the oplog can help update a secondary node to match the primary, guaranteeing accurate replication of the transaction. Figure 1 presents the general format of the oplog file, highlighting key fields such as:

ts: timestamp indicating when the operation occurred
h: history or hash identifier
v: oplog version
op: type of operation (insert, update, delete, command, etc.)
ns: the namespace indicating the target database and collection
o, o2: detailed operation objects.

Figure 1: Oplog file format.

Download full-size image

DOI: 10.7717/peerj-cs.3507/fig-1

Figure 2 shows sample oplog entries for insert and update operations. The examples illustrate how MongoDB records each action, allowing secondary nodes to reproduce the exact sequence of operations. These visual samples make it easier to understand how MongoDB internally represents state changes and how recovery processes interpret them.

Figure 2: Oplog example.

Download full-size image

DOI: 10.7717/peerj-cs.3507/fig-2

Secondary node

In a MongoDB replica set, a secondary node copies data from the primary node by applying operations from the primary’s oplog asynchronously. Secondary nodes replicate the oplog and execute the operations on their datasets, maintaining data consistency throughout the replica set.

Journal file

A journal file serves as a write-ahead log that MongoDB employs to keep a record of actions that alter the data. It functions as a mediator between operations in memory and permanent storage (data files on the disk). Writes to the journal file occur in memory and are periodically flushed to disk. Figure 3 illustrates the structure of the journal file, including components such as: the Header for the file identifier, MongoDB version, and checksum. All of these are useful during data integrity verification. Journal file format also contains journal sections for write operations’ timestamp, database & collection name, operation type, and file offset. So, MongoDB uses this file to bring the database to a stable state by reapplying all uncommitted or incomplete transactions. The expanded visual representation in Fig. 3 clarifies how journal entries are grouped and replayed.

Data files

These files store the actual data of MongoDB collections. Each collection has a file with a “.wt” extension, which contains the BSON data for documents. During transactions, the data in these files is modified.

Checkpoint

A checkpoint is a mechanism that periodically flushes the in-memory changes made by write operations to disk. This guarantees that all operations up to a certain point are safely written and stored in the data files, providing a durability guarantee for the system.

The article further outlines a literature study, methodology, experiments, results, and conclusive insights.

Literature study

This section reviews the previous research to compare recovery mechanisms in NoSQL databases, fostering ideas for novel solutions that improve recovery time.

Several researchers have explored data recovery and resilience strategies in NoSQL systems, contributing valuable insights while also highlighting certain limitations in applicability and scalability. Tsai, Hsiao & Lai (1959) investigated multiple data recovery techniques aimed at enhancing resilience and minimizing data loss within NoSQL environments. While their work presents a diverse range of solutions, it lacks validation through real-world testing, which raises concerns about the generalizability of their approaches across heterogeneous NoSQL systems. Sauer, Graefe & Härder (2019) through the FineLine framework, proposed mechanisms to simplify transactional storage and recovery in the context of large datasets and complex structural changes. However, their solution appears to be highly specialized, potentially limiting its applicability beyond specific NoSQL architectures. Luo (2012) addressed interference issues during checkpointing in distributed systems, proposing enhancements to system consistency. Although conceptually robust, the implementation complexity in large-scale systems and insufficient discussion on integration with existing infrastructures present practical limitations. Paithankar & Roesle (2023) focused on pragmatic strategies for data resilience, advocating for recovery process testing and leveraging multi-cloud region deployments. Nonetheless, their strategies are primarily tailored to MongoDB Atlas, restricting broader applicability to other NoSQL platforms. Matos & Correia (2016) introduced an “undo” operation to bolster data integrity and consistency in MongoDB. Despite its innovation, the approach remains narrowly scoped to MongoDB with limited discussion on scalability or adaptability to other NoSQL databases. da Silva, Holanda & Araujo (2016) explored challenges in replication, emphasizing consistency and storage efficiency within distributed systems. However, their work lacks concrete real-world case studies and does not deeply explore the trade-offs between consistency and availability. Wang & Tang (2012) presented foundational concepts for understanding scalable NoSQL frameworks, focusing particularly on Cassandra. While informative, the work is more introductory, offering limited technical depth concerning Cassandra’s internal mechanisms.

Comparative studies by Haughian, Osman & Knottenbelt (2016) and Gu et al. (2015) analyzed replication strategies in Cassandra and MongoDB, aiding database selection decisions. Nonetheless, their evaluations are primarily qualitative and do not provide quantitative metrics to assess system performance under varied workloads. Sung et al. (2019) proposed durability-enhancing techniques such as snapshots and transaction logs. These approaches, though theoretically sound, may face challenges in dynamic, high-transaction environments, particularly with respect to maintaining system performance.

Tam Vo et al. (2012) through the LogBase system, addressed the demands of write-intensive workloads in cloud-based applications like financial services. While effective in such contexts, the solution may face limitations when generalized to broader application domains. Ouyang et al. (2016) proposed a verification framework for ensuring transactional consistency in MongoDB. While useful, its MongoDB-specific design limits applicability to other NoSQL systems. Mansouri & Pathan (2020, 2021) introduced a communication-induced checkpointing algorithm that reduces the overhead associated with traditional checkpointing methods. However, the algorithm’s implementation is complex and may introduce new types of operational overhead. Ammann et al. (2002) contributed techniques to improve recovery from malicious transactions, enhancing system security. However, the narrow focus on security-driven failures may overlook other significant failure types, such as hardware faults or network partitions. Bao et al. (2016) conducted an empirical analysis of persistence mechanisms using snapshots and logs. Their findings are valuable in understanding failure scenarios, although edge cases and complex failure modes remain underexplored. Finally, Wang, Li & Lin (2007) proposed a disaster recovery mechanism tailored for volume replication systems. While the mechanism shows promise, it lacks comprehensive evaluation across diverse failure conditions and does not fully address performance trade-offs inherent in such systems.

While studying various research articles, we have observed that there are issues relevant to data consistency across the replica sets, managing the data structures with larger datasets, transaction robustness, and recovery speed during the entire recovery process. Not much work has been done in the context of faster recovery in distributed databases. Mission-critical applications deployed on distributed databases requiring 24 × 7 uptime cannot afford even a fraction of downtime, which can lead to financial loss or reputation damage. Thus, faster recovery is vital to ensure smooth transaction processing, especially in large-scale, real-time systems.

Proposed system

The proposed system has been implemented in MongoDB, a NoSQL database with a large market share across NoSQL databases. In MongoDB, when a write operation (insert, update, or delete) occurs, the write operation is applied to an in-memory buffer (cache) first for performance reasons. Before the operation is permanently written to the data files, it is written to the journal file to ensure durability and recovery in case of failure. The data is asynchronously flushed from memory to the main data files managed by the storage engine (like WiredTiger). For replica sets, the operation is logged in the oplog (Operation Log) to replicate the changes to secondary nodes. The entire sequence is shown in below Fig. 4, which illustrates the complete flow of this process—from the initial write request to journaling, oplog creation, secondary replication, and final acknowledgment.

Figure 4: Sequence diagram when write operation occurs.

Download full-size image

DOI: 10.7717/peerj-cs.3507/fig-4

In our proposed system, the transactions are monitored, and successful transactions are stored in a collection that is used for recovery at the time of failure. In the transaction evaluator and cleaner algorithm, the evaluator, as shown in Figs. 5 and 6.

Figure 5 presents the proposed system architecture, illustrating how write operations generated from multiple collections are processed through the primary node, recorded in the oplog, and then assessed by the transaction evaluator. The evaluator continuously scans the oplog entries to distinguish successful transactions from those that have been aborted due to concurrency conflicts, system interruptions, or incomplete operations. Only verified successful transactions are forwarded to the cleaner module, which removes redundant or invalid entries and prepares a filtered oplog. This filtered log forms a reliable source of truth for high-speed recovery because it eliminates unnecessary or faulty entries that could otherwise slow down reconstruction procedures.

Figure 6 provides the component diagram of the proposed system, offering a physical-layer view of how the modules interact within the database environment. The diagram highlights how the evaluator and cleaner components integrate with the primary node, journaling mechanism, and oplog subsystem. It also shows how the filtered oplog is persisted in a dedicated collection that can be immediately accessed during system failure. This design simplifies recovery by bypassing the need to process the full oplog, which may contain large volumes of aborted or irrelevant entries.

Together, Figs. 5 and 6 provide a detailed functional and structural representation of the proposed mechanism, clarifying how the system improves the speed and accuracy of data recovery in NoSQL databases.

Transaction evaluator and cleaner

To create filtered_oplog, which will be used to accelerate data recovery, an algorithm “to create filtered_oplog” is written to identify the successful transactions and store them in the filtered_oplog collection. The transaction evaluator and cleaner algorithm evaluates the transactions and cleans the dirty operations out of the oplog. This algorithm uses a linked list data structure to manage all the sets like transactions, sessions, database states, operations, and errors.

Transaction evaluator and cleaner algorithm, which is implemented at step 5 of algorithm to create filtered_oplog is as follows:

Algorithm::

To create filtered_oplog.

1 Definitions:

2 Let T represent the set of all transactions.

3 Let S represent the set of all sessions.

4 Let L represent the set of all logs.

5 Let D represent the set of all database states.

6 Let E represent the set of all errors.

7 Let O represent the set of operations, where

8 O = {o1, o2,…,on}.

9 Let R(O) represent the set of reverse operations corresponding to O.

10 Step 1: Initialize and start a connection to our

11 MongoDB instance

12 C ⊆ S

⟹

C represents the subset of sessions

13 with active connections to the database.

14 Step 2: Start a MongoDB session for handling

15 transactions

16 ∃ s ∈ S, t ∈ T : t is associated with session s.

17 Step 3: Implement the transaction logic, including

18 logging inverse operations

\forall

t ∈ T, ∃ Lt ⊆ L: Log inverse operations

⟹

20 Lt = {start_log, o1, o2, … , commit_log}.

21 Perform Database Operations:

22 Dt ⊆ D → Dt′ = Dt ∪ O

23 where Dt′ is the updated database state.

24 Commit or Abort:

25 If E =

\emptyset

, Commit t; Otherwise, Abort t.

26 Step 4: [Record oplog to filtered_oplog]

\forall

t ∈ T, ∃ Lt ⊆ L : Lt = {oplog, journal_log,

28 monitoring_log}.

29 If t succeeds:

30 Lt

∋

success_log.

31 If t aborts:

32 Lt

∋

error_log.

33 Step 5: [Filter functionality]

\forall

t ∈ T, if t is aborted, read Lt for inverse ope-rations R(O).

35 Find oplog entries:

36 ∃ f ⊆ Lt : f is the filtered_oplog for t.

37 Commit or Abort Modify:

38 If E =

\emptyset

, Commit R (O);

39 Otherwise, Abort R (O).

40 Step 6: [Perform undo for different operations]

\forall

o ∈ O, ∃ r ∈ R (O) : r is the inverse of o where o∈ {insert, update, delete}.

42 Step 7: [Ensure proper testing]

\forall

t ∈ T, R(O) → Dt′′

44 where Dt′′ = Dt

46 (validate data integrity after reversal).

47 Step 8: Exit

DOI: 10.7717/peerj-cs.3507/table-2

Algorithm:

Transaction evaluator and cleaner.

1 Input: O (Oplog)

2 Output: Successful transactions list cList (for recovery)

3 We are working within a universal set of oplog entries U, transactions T, and specific operation types {read, write, abort, commit}.

4 Step: 1 Define Sets and Initial States

5 O ⊆ U: The set of all entries in the oplog.

6 M ⊆ T: The set of transactions currently being processed.

7 cList, uList, wList, twList, tuList: Subsets of U, initially empty.

8 cList: Stores committed operations. (commit transaction list)

9 tuList: Stores oplog entries temporarily for read

10 operations. (temporary read list)

11 twList: Stores oplog entries temporarily for write operations. (temporary write operation list)

12 uList: Stores processed read operations after a commit. (read list)

13 wList: Stores oplog entries moved for certain write operations. (write operation list)

14 Step: 2 [Process Each Oplog Entry]

15 Iterate through o ∈ O. For each o:

16 Let op (o) ∈ {read, write, abort, commit} denote the operation type of o, and txn (o) ∈ T denote the transaction Ti associated with o.

17 Step: 3 [Check the type of operation op (o):]

18 If op (o) = write:

19 If txn (o) ∈ M:

20 wList ∪ {o} → wList

21 Otherwise:

22 twList ∪ {o} → twList

23 If op (o) = read:

24 If o ∉ tuList:

25 tuList ∪ {o} → tuList

26 If op (o) = abort:

27 tuList\tuList → tuList

28 (clear tuList)

29 If op (o) = commit:

30 wList ∪ twList → wList

31 (move entries from twList to wList)

32 twList\twList → twList (clear twList)

33 uList ∪ tuList → uList

34 (move entries from tuList to uList)

35 tuList\tuList → tuList (clear tuList)

36 cList ∪ {o} → cList (add o to cList)

37 Otherwise:

38 twList\twList → twList (clear twList)

39 Step: 4 [Proceed to the subsequent o ∈ O]

40 Step: 5 [Once all entries in O are processed]

41 wList\wList → wList (clear wList)

42 uList\uList → twList (clear uList)

43 Step: 6 [Return committed list to store in filtered_oplog]

44 return cList

45 Exit

DOI: 10.7717/peerj-cs.3507/table-3

Experiments and results

While generating experimental results for transaction evaluator and cleaner in MongoDB, we have simulated transactions, monitored their status, and evaluated the performance and reliability of our data recovery mechanism. The structured approach to achieve this is described as follows:

Environment setup

MongoDB cluster with primary and secondary nodes are used for simulating recovery operations. We have worked on MongoDB Atlas (Cloud database) in conjunction with MongoDB Compass, which has also helped us to perform various data analysis, querying, and management tasks through graphical environment.

We employed MongoDB Compass, integrated with MongoDB Atlas, to simulate and analyze database recovery operations. A MongoDB Atlas cluster was provisioned by creating an Atlas account and configuring the cluster with appropriate instance specifications and regional parameters. To ensure secure connectivity, the client machine’s IP address was added to the IP whitelist under the Network Access settings. For database authentication, a user was configured with administrative privileges, enabling read and write access to the cluster.

Connectivity between the application layer and the Atlas cluster was established using the official MongoDB Node.js driver. For local interaction and verification, MongoDB Compass was employed, using the provided connection string (URI) to interface with the remote Atlas cluster.

Recovery operations were simulated by focusing on the oplog-based replication mechanism. Specifically, we exported the oplog.rs collection from the local database, which contains a chronological record of write operations for replication purposes. Subsequently, custom scripts were developed to function as a transaction evaluator and cleaner, designed to identify and persist only those operations that were successfully committed before failure events.

Additionally, Node.js was utilized to facilitate real-time monitoring and evaluation of database activity. Leveraging MongoDB Change Streams, our implementation was capable of observing live changes within specified collections and executing callback functions in response to transactional events. This enabled precise tracking and evaluation of transaction consistency and accelerated recovery in a dynamic environment.

We have used a single MongoDB Cluster in two different user accounts to perform testing of our proposed method to recover the data and measure the time taken.

During recovery simulation, while executing our transaction evaluator and cleaner scripts using Node.js and MongoDB Atlas, we have performed experiments on 1.6 million transactions using our proposed method. These 1.6 million transactions are generated on MongoDB Compass from the databases available on the MongoDB Atlas cluster. We have generated the transactions and stored them as documents in a collection in MongoDB Compass using the insert methods. Further, these documents are used as input to filtered_oplog creation and consequently to transaction evaluator and cleaner scripts. We have also used the same 1.6 million transactions using the MongoDB restore utility. The results are as shown in Table 1, which shows the comparison of the Recovery Time taken by the proposed method and MongoDB Restore.

Table 1:

Comparison of recovery time.

Parameters	Data recovery using our methodology	Data recovery using MongoDB restore
Time taken for recovery of data	4.040100 (s.msµs)*	7.020070 (s.msµs)*

DOI: 10.7717/peerj-cs.3507/table-1

Note:

* s stands for seconds, ms stands for milliseconds (three digits), µs microseconds (three digits).

This experiment is exclusively executed, and the restoration process has been given high priority for execution, before recovery simulation. The recovery time difference between our methodology and MongoDB Restore varies only in microseconds every time we have executed the restoration process with the same set of transactions. Although the measured improvement of our method over MongoDB Restore appears in microsecond, this difference is significant in latency-sensitive, high-throughput systems. In environments that operate 24 × 7 and process millions of transactions per second, microsecond-level savings in recovery can reduce total downtime and accelerate recovery. This is particularly critical in applications such as financial services, e-commerce, and real-time analytics, where even minor recovery delays may result in service interruptions and revenue loss.

To confirm that the recovered data is consistent, we have applied the following data consistency accuracy formula.

$D a t a C o n s i s t e n c y A c c u r a c y = \frac{N u m b e r o f C o n s i s t e n t R e c o r d s a f t e r R e c o v e r y}{T o t a l D o c u m e n t s i n D a t a s e t} X 100 %$

$\begin{aligned} D a t a C o n s i s t e n c y A c c u r a c y & = \frac{1.6 m i l l i o n d o c u m e n t s}{1.6 m i l l i o n d o c u m e n t s} X 100 % \\ = 100 % . \end{aligned}$

The data consistency accuracy calculated by the above formula is 100%, which means we have total 1.6 million documents in data set and we have recovered the same number of documents i.e., 1.6 million by using our proposed system. This result highlights that faster recovery is not achieved at the expense of correctness. So, our proposed system accelerates data recovery without any data loss.

At the same time, we acknowledge that our current experiments were conducted on a single cluster under controlled workloads.

The proposed system is flexible and can be made compatible with other NoSQL databases like CouchDB, Cassandra and, Redis. Our algorithms and scripts can be implemented on these NoSQL databases due to similarity in their data structures used for recovery which provide opportunities for cross-platform applicability.

In Cassandra, the data structures which are used during data recovery are Commit log (like MongoDB journaling) for durability and the memTable (like MongoDB oplog) for storing database operations. Both the Commit log and the memTable are compatible with our proposed system. These similarities make it feasible to adapt the evaluator and cleaner scripts to different NoSQL environments. These structures closely resemble MongoDB’s journaling and oplog mechanisms, enabling straightforward adaptation of the proposed evaluator and cleaner scripts.

In CouchDB, write ahead log (WAL) is used for durability (like MongoDB Journaling), and append only storage design is used for all database operations. Both WAL and append only storage design are compatible with our proposed system. This design parallels MongoDB’s journaling and makes CouchDB compatible with filtered log–based recovery approaches.

In Redis, append only format (like MongoDB Journaling) and Redis Database Backup files (RDB) Snapshot (like MongoDB Oplog) are used and both are compatible with our proposed system. These features make Redis suitable for adopting filtered transaction logs to accelerate recovery.

These mechanisms share functional similarities with MongoDB’s journaling and oplog, suggesting that our transaction evaluator and cleaner scripts could be adapted with minimal modifications. In summary, our results demonstrate that the proposed system achieves accelerated recovery with complete data consistency. While the performance gains are expressed in microseconds, their significance is amplified in high-throughput and latency-sensitive applications where rapid recovery is essential. Furthermore, the flexibility of the approach points toward its applicability across other NoSQL platforms.

Conclusions

To achieve the research objective of accelerating data recovery and creating a robust recovery mechanism, our research work is wrapped up by emphasizing the subsequent achievements, such as monitoring and evaluating transactions by identifying successful and failed transactions using oplog and journal with the construction of a new oplog called filtered_oplog. We have developed an expeditious log-based data recovery methodology for NoSQL databases using filtered_oplog. We have validated the recovered data accuracy by applying data recovery accuracy formulae with verification and confirmation that no loss of data after implementing our methodology. Moreover, the additional overhead is very little compared to the time saved during the recovery process.

Our proposed method works with write-ahead log-like data structures without changing it. Additionally, our method is better than existing methods because it improves the recovery time. Although at a moment, the recovery time of our method minutely differs from that of the existing recovery utility. However, with the larger dataset, our method would have a bigger difference. While existing research explores a wide range of strategies for data recovery in NoSQL systems, such as log-based recovery, checkpointing, undo operations, and resilience practices in platforms like MongoDB Atlas, many of these approaches either focus on system-wide techniques or introduce added complexity and resource overhead. In contrast, the proposed expeditious log-based data recovery methodology introduces a lightweight and precise solution by constructing a filtered_oplog derived from MongoDB’s native oplog and journal files. This method not only differentiates between successful and failed transactions, but also ensures accurate data recovery without incurring additional system overhead. Unlike previous works, the proposed approach was quantitatively validated for accuracy, demonstrating complete data recovery and improved efficiency, thereby offering a more practical and resource-efficient solution tailored specifically for NoSQL environments.

The novelty of the proposed method lies in its ability to operate directly on existing write ahead log–like (WAL) structures without requiring any modification to the native logging mechanism of the database. Unlike traditional utility-based recovery approaches, which exhibit near-constant performance regardless of scale, our method shows increasing efficiency gains as dataset volume grows. While the difference in recovery time is modest for smaller datasets, the proposed approach shows progressively larger performance improvements with real-world, large-scale data.

Our proposed system is closely associated with the work of Matos & Correia (2016) and Ammann et al. (2002). Matos & Correia (2016) tool requires database downtime and the recovery process is slowed down with high-version of documents. Whereas our proposed system works with real time database, no downtime is required and the recovery process is expedited. The work of Ammann et al. (2002) presumes that the malicious transactions are pre-identified by a security mechanism, not by the recovery process. So, it is dependent on the security mechanism for the recovery process. However, the proposed work identifies such transactions well in advance to accelerate the recovery process.

Looking ahead, the methodology can be generalized to work across diverse NoSQL platforms by adopting universal log formats capable of accommodating multiple data models. This extension would support cross-platform resilience strategies and allow seamless integration with heterogeneous systems. Moreover, incorporating blockchain-backed logging could further enhance data integrity and ensure immutable, tamper-proof recovery records.

Overall, the enhanced analysis, precision filtering, and real-time operability introduced in this work constitute a substantial and novel contribution to the domain of NoSQL data recovery.

Supplemental Information