Skip to main content

Notice: this Wiki will be going read only early in 2024 and edits will no longer be possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

Jump to: navigation, search

Difference between revisions of "Google Summer of Code 2015 Ideas"

(GeoTrellis: Cassandra Backend to GeoTrellis-Spark)
(GeoTrellis: Cassandra Backend to GeoTrellis-Spark)
Line 50: Line 50:
 
* Potential students: Pawel Nowak (list closed)
 
* Potential students: Pawel Nowak (list closed)
  
=== [http://www.locationtech.org/projects/technology.geotrellis GeoTrellis]: Cassandra Backend to GeoTrellis-Spark  ===
+
=== [http://geotrellis.io GeoTrellis]: Cassandra Backend to GeoTrellis-Spark  ===
  
GeoTrellis is a [http://scala-lang.org Scala]-based [http://www.locaionttech.org LocationTech] project that is a framework for fast, parallel processing of raster data in the geospatial domain. Recent development efforts have allowed GeoTrellis to give the [https://spark.apache.org/ Apache Spark] cluster compute engine geospatial capabilities, focusing on the processing of large scale [http://webhelp.esri.com/arcgisdesktop/9.2/index.cfm?TopicName=What_is_raster_data%3F raster data].
+
GeoTrellis is a [http://scala-lang.org Scala]-based [http://www.locaionttech.org LocationTech] project that is a framework for fast, parallel processing of geospatial data. Recent development efforts have allowed GeoTrellis to give the [https://spark.apache.org/ Apache Spark] cluster compute engine geospatial capabilities, focusing on the processing of large scale [http://webhelp.esri.com/arcgisdesktop/9.2/index.cfm?TopicName=What_is_raster_data%3F raster data].
  
 
GeoTrellis's integration to spark currently supports [http://hadoop.apache.org/docs/r1.2.1/hdfs_design.html HDFS] and [https://accumulo.apache.org/ Accumulo] as backends to store and retrieve raster data across a cluster. [http://cassandra.apache.org/ Cassandra] is another distributed data store that could provide a rich set of features and performance opportunities to GeoTrellis running on top of Spark. It's also a popular distributed data store that a number of people interested in doing large scale geospatial computations are already using.
 
GeoTrellis's integration to spark currently supports [http://hadoop.apache.org/docs/r1.2.1/hdfs_design.html HDFS] and [https://accumulo.apache.org/ Accumulo] as backends to store and retrieve raster data across a cluster. [http://cassandra.apache.org/ Cassandra] is another distributed data store that could provide a rich set of features and performance opportunities to GeoTrellis running on top of Spark. It's also a popular distributed data store that a number of people interested in doing large scale geospatial computations are already using.
Line 62: Line 62:
 
'''Possible Mentors:''' Rob Emanuele, Eugene Cheipesh
 
'''Possible Mentors:''' Rob Emanuele, Eugene Cheipesh
  
'''Interested Students:'''  
+
'''Interested Students:'''
 +
 
 +
=== [http://geotrellis.io GeoTrellis]: Computer Vision algorithms on Satellite Imagery  ===
 +
 
 +
GeoTrellis is a [http://scala-lang.org Scala]-based [http://www.locaionttech.org LocationTech] project that is a framework for fast, parallel processing of raster data in the geospatial domain. One application of GeoTrellis is to process imagery collected by satellites and other remote sensing sources.
 +
 
 +
Computer vision (CV) algorithms can be applied during satellite imagery processing to detect and extract geometries such as trees or buildings (also known as image segmentation), or to remove clouds and mosaic images of the same area over time to create the most clear images of land. This project would include implementing one or more CV algorithms in Scala using the GeoTrellis library for the purpose of processing satellite imagery. There will be a focus on parallel algorithms, so that these processes can work on large scale satellite imagery on a cluster.
 +
 
 +
Please get in touch if you are interested in this project. You can reach out to the [https://groups.google.com/forum/#!forum/geotrellis-user mailing list] or email Rob Emanuele at remanuele@azavea.com.
 +
 
 +
'''Possible Mentors:''' Rob Emanuele, Eugene Cheipesh
 +
 
 +
'''Interested Students:'''
  
 
=== Other Sources of Ideas ===
 
=== Other Sources of Ideas ===
  
 
There are numerous bugs in the Eclipse Bugzilla issue tracking system marked as "[https://bugs.eclipse.org/bugs/buglist.cgi?bug_status=NEW&bug_status=REOPENED&columnlist=bug_severity%2Cpriority%2Cop_sys%2Cassigned_to%2Cbug_status%2Cresolution%2Cshort_desc%2Cvotes&field0-0-0=classification&field0-1-0=short_desc&field0-1-1=short_desc&keywords=helpwanted&keywords_type=allwords&negate0=1&query_format=advanced&type0-0-0=equals&type0-1-0=allwords&type0-1-1=allwords&value0-0-0=Mylyn&value0-1-0=%5Bconnector%5D&value0-1-1=%5Bbridge%5D helpwanted]" that may give you ideas.
 
There are numerous bugs in the Eclipse Bugzilla issue tracking system marked as "[https://bugs.eclipse.org/bugs/buglist.cgi?bug_status=NEW&bug_status=REOPENED&columnlist=bug_severity%2Cpriority%2Cop_sys%2Cassigned_to%2Cbug_status%2Cresolution%2Cshort_desc%2Cvotes&field0-0-0=classification&field0-1-0=short_desc&field0-1-1=short_desc&keywords=helpwanted&keywords_type=allwords&negate0=1&query_format=advanced&type0-0-0=equals&type0-1-0=allwords&type0-1-1=allwords&value0-0-0=Mylyn&value0-1-0=%5Bconnector%5D&value0-1-1=%5Bbridge%5D helpwanted]" that may give you ideas.

Revision as of 18:14, 20 February 2015

Existing bugs marked "helpwanted" are a good source of project ideas.
Warning2.png
Feel free to contribute the discussion on an Eclipse bug. Keep the discussion on bugs technical. The bugs are not a good place to talk about Google Summer of Code participation. Use the soc-dev mailing list for that.


Rules

  • Be creative
  • Be specific: what do you want to be implemented
  • If you are willing to mentor those ideas, add your name and email to the idea.
  • GSoC project ideas should align with an existing Eclipse project
  • If you're an interested student, add your name and email next to the idea. It is ok to have several students interested in one idea.
  • Aspiring students and mentors need to register and submit their proposals on Google´s Melange

Mentors info

If you were a mentor last year then you are automatically in the list this year (the GSoC site may require that you re-register, but we think of you as "in").

Note that we only accept as mentors people who are known to us. This includes Eclipse committers. If you would like to be a mentor, please either introduce yourself to the group using the soc-dev mailing list, or send a note to EMO.

Ideas submission

Idea proposal should contain the following information:

  • project title, like "WTP - Improve auto-complete in xml editor"
  • description with links to bug reports, project wiki pages, etc
  • Reporter: who submitted idea (optional e-mail)
  • Possible Mentors: who would like to mentor the students
  • More info: other links or e-mail
  • Eclipse Project: link to main eclipse project that improvement is targeting
  • Potential students: who is interested (with optional e-mail). This one completely informal, to actually be interested you need to submit the proposal. Contact the idea owner or possible mentor to get some details before submitting it.

Ideas

These are some ideas. Students feel free to base your GSoC proposals on these ideas (note that you are more likely to find a mentor for an idea that has been proposed by a mentor). Some of these ideas can be factored into multiple projects; a GSoC project proposal can work on parts of these ideas (i.e. you don't necessarily have to do it all).

Error Reporting - Mylyn-like error notifications and other UI candies

Project News Feeds for / in Eclipse

  • Description: News feed aggregator which presents project-specific news in the IDE. It will allow projects to inform it's users (and only it's users) about new releases or anything related (e.g. security issues) as well as polls (e.g. which notification style to use) and more. The purpose is to provide better communication and feedback between projects and it's users.
  • Reporter: Marcel Bruch
  • Possible Mentors: Johannes Dorn, Marcel Bruch
  • More Info: See discussions at ide-dev Bug 459911
  • Eclipse Project: To be decided
  • Potential students: Pawel Nowak (list closed)

GeoTrellis: Cassandra Backend to GeoTrellis-Spark

GeoTrellis is a Scala-based LocationTech project that is a framework for fast, parallel processing of geospatial data. Recent development efforts have allowed GeoTrellis to give the Apache Spark cluster compute engine geospatial capabilities, focusing on the processing of large scale raster data.

GeoTrellis's integration to spark currently supports HDFS and Accumulo as backends to store and retrieve raster data across a cluster. Cassandra is another distributed data store that could provide a rich set of features and performance opportunities to GeoTrellis running on top of Spark. It's also a popular distributed data store that a number of people interested in doing large scale geospatial computations are already using.

This project would include implementing a GeoTrellis Catalog implementation for Cassandra, which allows us to save and load raster layers as Spark RDD's, as well as metadata. Eventually we will also be storing vector data in these data stores, and depending on how far we are in the development of that functionality, this project should support the storing and saving of vector data as well.

Please get in touch if you are interested in this project. You can reach out to the mailing list or email Rob Emanuele at remanuele@azavea.com.

Possible Mentors: Rob Emanuele, Eugene Cheipesh

Interested Students:

GeoTrellis: Computer Vision algorithms on Satellite Imagery

GeoTrellis is a Scala-based LocationTech project that is a framework for fast, parallel processing of raster data in the geospatial domain. One application of GeoTrellis is to process imagery collected by satellites and other remote sensing sources.

Computer vision (CV) algorithms can be applied during satellite imagery processing to detect and extract geometries such as trees or buildings (also known as image segmentation), or to remove clouds and mosaic images of the same area over time to create the most clear images of land. This project would include implementing one or more CV algorithms in Scala using the GeoTrellis library for the purpose of processing satellite imagery. There will be a focus on parallel algorithms, so that these processes can work on large scale satellite imagery on a cluster.

Please get in touch if you are interested in this project. You can reach out to the mailing list or email Rob Emanuele at remanuele@azavea.com.

Possible Mentors: Rob Emanuele, Eugene Cheipesh

Interested Students:

Other Sources of Ideas

There are numerous bugs in the Eclipse Bugzilla issue tracking system marked as "helpwanted" that may give you ideas.

Back to the top