Skip to content

Commit

Permalink
Tweaked boilerplate: replicable -> reproducible (#1538)
Browse files Browse the repository at this point in the history
  • Loading branch information
lintool authored May 1, 2021
1 parent f5876f9 commit 24198e4
Show file tree
Hide file tree
Showing 297 changed files with 814 additions and 446 deletions.
2 changes: 1 addition & 1 deletion docs/experiments-jdiq2018.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ Regression effectiveness values changed at commit [`e71df7a`](https://github.com
**Update (6/12/2019)**:
With commit [`75e36f9`](/~https://github.com/castorini/anserini/commit/75e36f97f7037d1ceb20fa9c91582eac5e974131), which upgrades Anserini to Lucene 8.0, we are no longer maintaining the reproducibility of these experiments.
That is, running these commands will produce results different from the numbers reported here.
The most recent version in which these results are reproducible is the [v0.5.1](/~https://github.com/castorini/anserini/releases) release (6/11/2019).
The most recent version in which these results are reproduce is the [v0.5.1](/~https://github.com/castorini/anserini/releases) release (6/11/2019).

## Parameter Tuning

Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/analysis/AnalyzerUtils.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -17,7 +17,6 @@
package io.anserini.analysis;

import org.apache.commons.io.FileUtils;
import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.CharArraySet;
import org.apache.lucene.analysis.LowerCaseFilter;
import org.apache.lucene.analysis.StopFilter;
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/analysis/TweetAnalyzer.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/ann/FeatureVectorsTokenizer.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/ann/IndexVectors.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/AclAnthology.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/BibtexCollection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
14 changes: 8 additions & 6 deletions src/main/java/io/anserini/collection/C4Collection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -20,19 +20,21 @@
import com.fasterxml.jackson.databind.MappingIterator;
import com.fasterxml.jackson.databind.ObjectMapper;

import java.io.*;
import java.io.BufferedReader;
import java.io.FileInputStream;
import java.io.FilterInputStream;
import java.io.IOException;
import java.io.InputStream;
import java.io.InputStreamReader;
import java.nio.charset.StandardCharsets;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.StandardOpenOption;
import java.text.DateFormat;
import java.text.SimpleDateFormat;
import java.time.Instant;
import java.util.List;
import java.util.NoSuchElementException;
import java.util.stream.Collectors;
import java.util.zip.GZIPInputStream;
import java.util.Date;
import java.time.Instant;


public class C4Collection extends DocumentCollection<C4Collection.Document> {
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/CarCollection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -12,7 +12,6 @@
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*
*/

package io.anserini.collection;
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -12,7 +12,6 @@
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*
*/

package io.anserini.collection;
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -12,7 +12,6 @@
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*
*/

package io.anserini.collection;
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/CoreCollection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down Expand Up @@ -27,7 +27,13 @@
import java.io.FileReader;
import java.io.IOException;
import java.nio.file.Path;
import java.util.*;
import java.util.ArrayList;
import java.util.Iterator;
import java.util.List;
import java.util.Map;
import java.util.NoSuchElementException;
import java.util.Objects;
import java.util.Set;
import java.util.stream.Stream;

public class FeverSentenceCollection extends DocumentCollection<FeverSentenceCollection.Document> {
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/FileSegment.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/HtmlCollection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
3 changes: 1 addition & 2 deletions src/main/java/io/anserini/collection/Iso19115Collection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -22,7 +22,6 @@

import java.io.BufferedReader;
import java.io.FileReader;
import java.lang.StringBuilder;
import java.io.IOException;
import java.nio.file.Path;
import java.util.Arrays;
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/JsonCollection.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/SourceDocument.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
2 changes: 1 addition & 1 deletion src/main/java/io/anserini/collection/StringTransform.java
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Anserini: A Lucene toolkit for replicable information retrieval research
* Anserini: A Lucene toolkit for reproducible information retrieval research
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand Down
Loading

0 comments on commit 24198e4

Please sign in to comment.